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An Infectious Clone for Human Parainfluenza Virus Type 3 



This work was supported in part by National Institutes of Health Grant No. 32027 from the 
Institute of Allergy and Infectious Diseases. The government may have rights in this invention. 

BACKGROUND 

Recognized in 1956 as a cause of respiratory infection in man, human parainfluenza viruses 
5 (HPIV) are believed to account for 4 to 22 percent of the respiratory illnesses in children, second only to 
respiratory syncytial virus in this regard. HPIV are important causes of the lower respiratory tract diseases 
such as pneumonia, and bronchiolitis, and are the most common cause of croup in young children. Of the 
four HPIV serotypes, 1-4, type 3 virus (HPIVo), appears to be the most virulent, frequently causing 
bronchiolitis and pneumonia during the first month of life. 

10 Unfortunately, effective vaccines or antiviral therapies, which can be used to prevent or treat 

HPIV-induced infections, are not presently available. Standard methods which are used to produce inactive 
viruses, such as heat inactivation or chemical treatment of the virus, have been unsuccessful with all HPIV 
strains and serotypes, including HPIV-3. Moreover, standard methods for producing attenuated viruses 
produce mutations at random sites and do not allow one to modify the HPIV genome at specific sites or to 

1 5 control the number of mutations that are introduced into genome. 

Human parainfluenza viruses are enveloped, single-stranded, negative sense RNA viruses that are 
members of the param>Tcovirus genus within the family Paramyxoviridae. Replication of the human 
parainfluenza viral genome (vRNA) is similar to that of other members of the Paramyxoviradae family. 
Upon infection of a cell, transcription is the major RNA synthetic event, resulting in the production of the 

20 viral mRNAs from the negative-sense genome, i.e., the vRNA. Later in infection a transition to RNA 
replication occurs, resulting in synthesis of a full-length, antigenomic, positive-sense RNA, which serves as 
the template for synthesis of additional negative-sense genomic RNA. Transcription and replication of the 
genomic RNA is dependent upon formation of a ribonucleoprotein complex (RNP) consisting of the 15462 
nucleotide genomic RNA encapsidated by the nucleocapsid protein (NP), and the closely associated 

25 phosphoprotein (P), and the large (L) polymerase protein. Several host cell factors are also involved in the 
replicative cycle of HPIV. 

The requirement for an intact RNP for HPIV has hindered analysis of HPIV transcription and 
replication in a cell-free system. Efforts to encapsidate HPIV-3 vRNA in vitro have failed, and unlike the 
positive sense RNA viruses, naked HPIV vRNA is not infectious. Moreover, there currently are no known 

30 systems for preparing recombinant HPIV, including recombinant infectious HPIV-3. 
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Accordingly, there is a need for new reagents, systems, and methods that enable one to produce a 
recombinant HPIV, particularly a recombinant, infectious HPIV-3. Recombinant systems that permit one 
to introduce one or more site-specific mutations into the genome of HPIV, particularly HPIV-3, are 
desirable. Recombinant systems which allow one to characterize the effect of site-specific mutations on 
the transcription or replication of human parainfluenza viral RNA and to identify the site specific mutations 
which lead to the production of attenuated HPIV are especially desirable. 



SUMMARY OF THE 1>JVENTI0N. 

In accordance with the present invention a system for generating recombinant, human 
parainfluenza virus, particularly infectious, recombinant, human parainfluenza virus type 3 (HPIV-3) is 
provided. In one embodiment, the system comprises a clone comprising a nucleotide sequence that 
encodes a full-length, positive sense, anti-genome of HPIV, and at least one support clone comprising a 
nucleotide sequence that encodes the HPIV P protein and the HPIV L protein. In another embodiment, the 
system further comprises a support clone which comprises a nucleotide sequence that encodes the HPIV 
NP protein. Preferably, each of the clones in the system comprises an RNA polymerase promoter which is 
operatively linked to the respective HPIV nucleotide sequence contained within the clone. 

The present invention also provides a clone which comprises a nucleotide sequence encoding the 
full-length, positive sense, anti-genome of HPIV-3. The clone also comprises an RNA polymerase 
promoter operatively linked to the HPIV-3 antigenome-encoding sequence. In a preferred embodiment, 
the clone further comprises a nucleotide sequence which encodes a ribozyme immediately downstream 
from the sequence encoding the HPlV-3 anti-genome. 

The present invention also relates to a method of preparing recombinant HPIV-3 virus having site- 
specific mutations in the HPIVo genome. The method comprises preparing a clone comprising a modified 
HPIV-3 antigenome-encoding sequence; introducing the modified HPIV-3 clone and support clones which 
comprise nucleotide sequences encoding an HPIV-3 P protein, an HPIV-3 L protein, and, preferably, an 
HPIV-3 NP protein into host cells.; and culturing the host cells under conditions that allow for synthesis of 
the modified HPIV-3 antigenome and the L, P, and NP proteins of HPIV-3, 

The ability to produce recombinant, HPIV-3 virus genetically engineered to contain site-specific 
mutations within the HPIV-3 genes and cis-actmg elements expedites the study of all aspects of the virus 
replication cycle. Additionally, a system which permits production of recombinant HPIV that is 
genetically engineered to contain site-specific mutations within the HPIV-3 genome is useful for 
identifying attenuating parainfluenza genotypes and for developing a live vaccine for human parainfluenza 
virus. 



BRIEF DESCRIPTION OF THE FIGURES 
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Figure 1 depicts the DNA form of the nucleotide sequence of the HPIV-3 genome and shows the 
location of restriction sites, the leader sequence, the trailer sequence, and the protein encoding regions of 
the genome. 

Figure 2 is a restriction map of the pOCUS-2^" vector. 
5 Figure 3 is a restriction map of pMG(+) showing the location of the leader sequence, luciferase 

encoding region, and theT7 promoter and terminator. 

Figure 4 is a restriction map of pHPIV-3 showing the location of the leader sequence and the 
protein encoding regions of HPIV-3. 

Figure 5 is a schematic depiction of the full-length infectious clone, pHPIV-3. VV(|). vaccinia 
1 0 virus polymerase stop signal (TTTTTNT); T7, T7 RNA polymerase promoter; le, HPlV-3 leader sequence; 
NP, P, M, F, HN and L are the HPIV-3 protein coding regions; tr, HPIV-3 trailer sequence; Rz, the 
hepatitis delta virus antigenomic ribozyme; T7<(), T7 RNA polymerase terminator signal. Regions 
containing substitution mutations are expanded and shown above with the specific changes indicated. The 
A to G change at viral base 94 creates a Sad site and the A to G change at viral base 15389 creates a Swl 
1 5 site. 



DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the present invention a system for generating recombinant, human 
parainfluenza virus is provided. In a preferred embodiment, the system is used to generate recombinant 
20 HPlV-3. The system comprises a clone comprising a nucleotide sequence, preferably a double-stranded 
DNA sequence, which encodes a full-length, positive sense anti-genome of HPIV hereinafter referred to as 
the "HPIV clone", and one or more support clones which comprise nucleotide sequences that encode an 
HPIV P protein and an HPIV L protein. The nucleotide sequences that encode the HPIV P protein and 
HPIV L protein may be within the same clone. However, for ease of manipulation, it is preferred that the 
25 nucleotide sequences that encode the HPIV P protein and the HPIV L protein be on separate clones. 
Preferably the HPIV clone comprises a sequence encoding an HPIV-3 antigenome. Preferably the support 
clone or clones encode a P protein and an L protein of HPIV-3. In another embodiment the system further 
comprises a support clone which comprises a nucleotide sequence that encodes the HPIV NP protein, 
preferably the HPIV-3 NP protein. 
30 As used herein "clone" refers to double-stranded DNA that can be introduced into a cell and 

expressed. The clone may be in the form of a viral vector such as, for example, a vaccinia viral vector, or, 
preferably, in the form of a plasmid. Preferably, the HPIV clone and the support clones each comprise an 
RNA polymerase promoter, more preferably a T7 RNA polymerase promoter. Each of the RNA 
polymerase promoters is operatively linked to the corresponding HPIV encoding sequence in the clone, 
35 Thus, the RNA polymerase promoter on the HPIV clone is operatively linked to the HPIV sequence and 
the RNA polymerase on the support clones are operatively linked to the sequence or sequences encoding 



3 
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the respective HPIV protein. Preferably, the plasmids comprising the clone also comprise an origin of 
replication, particularly a bacterial origin of replication. 

The present invention also provides a clone which comprises a nucleotide sequence encoding the 
anti-genomic sequence of HPIV-3, hereinafter referred to as the *'HPIV-3 clone". Preferably, the HPIV-3 
5 clone encodes a full-length antigenomic sequence of HPIV-3. As used herein "full-length" means that the 
anti-genomic sequence is complementary to the entire negative sense, genomic sequence of HPIV-3 
extending from the 3' nucleotide of the leader sequence through the 5'nucleotide of the trailer sequence of 
the HPIV-3 genome. The DNA form of the full-length, genomic sequence of HPIV-3, SEQ ID NO: I, is 
shown in Fig. 1. In addition to the leader and trailer sequences, the HPIV-3 clone contains sequences 
10 encoding the HPIV-3 proteins N, P, M. F, KN, and L, as well as the cis-acting elements. The HPIV-3 
clone may encode a wild-type HPIV-3 antigenome sequence or a modified HPIV-3 antigenome having one 
or more mutations contained therein. The mutation may be in the form of a foreign gene which is inserted 
into the HPIV-3 antigenome-encoding sequence. Preferably, the mutations are substitutions of one or more 
nucleotides, deletions of 6 to 12 nucleotides, or additions of 6 to 12 nucleotides in the HPIV-3 
1 5 antigenome-encoding sequence. More preferably, the modified HPIV-3 clone contains substitutions either 
in the genes or the cis-acting elements, or both of the HPIV-3 antigenome-encoding sequence. 

Preferably, the HPIV-3 clone is a plasmid that comprises a nucleotide sequence which encodes a 
ribozyme, more preferably the hepatitis delta virus antigenomic ribozyme, immediately downstream from 
the HPIV antigenome-encoding sequence. Following transcription of the clone, the ribozyme cleaves the 
20 riboz>'me from the HPIV antigenome to provide a replication competent 3' end on the antigenome. More 
preferably, the HPIV-3 clone also comprises an RNA polymerase terminator following the ribozyme 
sequence. In one embodiment, the HPIV-3 clone is the plasmid pHPIV-3 depicted in Fig. 5, which 
plasmid was deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville, MD 

20852, USA, on May , 1 998 and has Accession Number . 

25 The present invention also relates to a method of preparing recombinant HPIV, particularly 

HPIV-3, using the above described system. The method comprises introducing an HPIV clone and the 
suppon clones which encode HPIV P protein and HPIV L protein, into host cells, preferably human cells; 
culturing the host cells under conditions that allow for formation of an HPIV anti-genomic transcript, 
synthesis of the HPIV genome (vRNA) and the HPIV proteins L, P, and NP, and formation of a 
30 recombinant HPIV; and recovering the recombinant HPIV from the culture. Preferably the host cells which 
are transfected with the HPIV clone and support clones, contain an RNA polymerase that corresponds to 
the RNA polymerase promoter that is operatively linked to the HPIV sequences in the HPIV clone and the 
support clones. In a preferred embodiment, a support clone comprising the nucleotide sequence which 
encodes the HPIV-NP protein operatively linked to an RNA polymerase promoter is also introduced into 
35 the cells. Preferably, the host cells are infected with a viral recombinant, preferably a vaccinia virus 
recombinant, which expresses the RNA polymerase, more preferably the T7 RNA polymerase, prior to or 
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in combination with transfection with the HPIV clone and support clone or clones. When such cells are 
infected with the vaccinia virus recombinant, it is preferred that the HPIV-3 clone also comprise a vaccinia 
virus RNA polymerase temiinator upstream of the T7 RMA polymerase and a vaccinia virus RNA 
polymerase terminator downstream of the T7 RNA polymerase terminator. 
5 The present invention also relates to a method of introducing site-specific mutations into the 

genome of a recombinant HPIV-3. The method comprises preparing a a modified HPIV-3 clone 
comprising one or more mutations in the sequence which encodes the HPIV-3 anti-genome; introducing 
the modified HPIV-3 clone and support clones comprising sequences which encode HPIV-3 P protein and 
HPIV-3 L protein, and preferably, HPIV-3 NP protein into host cells; and culturing the host cells under 

10 conditions that allow for formation of a modified HPIV-3 antigenomic transcript and synthesis of the 
HPIV-3 L, P, and NP proteins. The modified HPIV-3 clone and the support clones contain an RNA 
polymerase promoter that is operatively linked to the HPIV-3 protein-encoding sequences. The host cells 
contain within the cytoplasm thereof an RNA polymerase that corresponds to the RNA polymerase 
promoter on the modified HPIV-3 clone and the support clones. 

15 Preferably, the modified HPIV-3 clone, containing one or more mutations therein, is made by 

conventional PGR techniques using an HPIV-3 clone as a template. The mutations are made in the cis- 
acting elements of the HPIV-3 sequence or in an HPIV-3 protein encoding sequence. Preferably the 
mutation is made in the L protein-encoding sequence. If mutations are made in the HPIV-3 protein- 
encoding sequences of the HPIV-3 clone, it is preferred that a similar type of mutation be made in the same 

20 site in the protein encoding sequence of the corresponding support clone. For example, if a mutation is 
made at a specific site in the L protein-encoding sequence of the HPIV-3 clone, it is preferred that the same 
mutation be made at the same site in the L protein-encoding sequence of tlie L protein-encoding support 
clone. Such method is useful for identifying mutations that block the synthesis of viral particles or result in 
the production of non-infectious or non-virulent HPIV-3. 

25 To determine whether the mutated viruses produced by the above-described method are non- 

virulent, i.e., attenuated, the mutated viruses are first tested in vitro to determine whether the mutation has 
resulted in a slower growing phenotype, i.e., the mutated virus grows more slowly in tissue culture than the 
wild-type virus. The mutated viruses which exhibit this phenotype are then examined in vivo , by injection 
into an animal, such as the cotton rat, which is good experimental model for parainfluenza virus. The 

30 infected animals are then examined to determine if they are producing antibodies to HPlV-3 and to 
determine if there is a reduction in the severity of symptoms as compared to animals infected with wild- 
type virus. 

The ability to produce recombinant HPIV-3 virus genetically engineered to contain specific 
alterations within the HPIV-3 genes and cis-acting elements expedites the study of all aspects of the virus 
35 replication cycle. Additionally, a system which permits production of recombinant HPIV that is 
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I genetically engineered to contain specific alterations within the HPIV-3 genes is useful for identifying 

attenuating parainfluenza genotypes and for developing a live vaccine for human parainfluenza virus. 

The following examples of methods of preparing a fuil-length cDNA clone of HPIV-3 and 
methods of preparing a modified or mutated, infectious, recombinant HPIV -3 are for purposes of 
5 illustration and are not intended to limit the scope of the invention. 
Example 1. Construction of a Full-Length cDNA Clone of HPIV-3. 

The construction of a full-length infectious clone of HPIV-3 containing mutations at specific sites 
was achieved by a two-step process. The initial step was the generation of a minireplicon which contained 
the positive sense leader portion region and trailer regions of HPIV-3. The second step involved the 
10 insertion of RT-PCR fragments derived from HPIV-3 genomic UNA into the minireplicon.The positive- 
sense minireplicon contained the following: A T7 promoter which directed the synthesis of two non-viral G 
residues, followed by the positive-sense leader region of HPIV-3, a portion of the NP 5' UTR (to viral base 
97), the luciferase gene, a portion of the L 3*UTR (starting at viral base 15387) and trailer sequences of 
HPIV-3. The full-length nucleotide sequence of HPlV-3 genome and the location of the leader sequence, 
15 trailer sequence and protein-encoding regions is shown in Figure 1. The hepatitis delta virus antigenomic 
ribozyme followed to effect precise cleavage after the 3 terminal HPIV-3 specific base. A T7 RNA 
polymerase terminator was also incorporated into the replicon followed the ribozyme sequence. 
Additionally, vaccinia virus polymerase termination signals were inserted immediately upstream and 
downstream of the aforementioned sequences. During the construction, single base changes were created 
20 in the regions encoding the NP 5' UTR and the L 3 UTR. An A to G change at viral base 94 and the A to 
G change at base 15389 created Sad and Stul sites, respectively, which served as genetic tags to identify 
virus as being of recombinant origin. 

The vector pOCUS-2 (Novagen) was chosen as the starting plasmid for preparing the mini- 
replicon [pPIV3-MG(+)], because of its small size (1930 bp). It is believed that the use of a small starting 
25 plasmid may increase the stability of the full-length clone. 

The mini-replicon was constructed by generating PGR products encoding the leader and trailer 
regions flanked by a T7 promoter and hepatitis delta virus antigenomic ribozyme, respectively. The 
primers used for synthesis of the T7 promoter/leader region were: 5'- 
TAGTCGGCCC TAATACGACTCACTATAGG ACCAAACAAGAGAAGAA 
30 ACT-3\ SEQ ID N0:2, and 5^GAAATTATAGAGCTCCCTTTTCT-3', SEQ ID N0:3. The first primer 
encodes an EagI site and the T7 promoter (underlined) and the second primer introduced an A to G base 
change at viral base 94, (bold) within the 5' untranslated region (UTR) of the NP mRNA, which creates a 
Sad site. The template for this reaction pHPIV3-CAT, described in De, B. P. and A. K. Banerjee. (1993.) 
Rescue of synthetic analogs of genome RNA of human parainfluenza virus type 3. Vir. 196:344-348, 
35 which is incorporated herein by reference. The resulting PGR product was cloned into the Eagl and Sad 
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sites of pOCUS-2, which is depicted in Figure 2, The primers used for synthesis of the trailer/ribozyme 
region were: 

5^-TAAGGCCTAAAGATAGACAAAAAGTAAGAAAAACATGTAATATATATA 
TACCAAACAGAGTTCTTCTCTTGTTTGGTGGGTCGGCATGGCATCTC-3\ 
5 SEQ ID N0:4, and 5'-CTGGGTAC CTCCCTTAGCCATCCGAGT -3\ SEQ ID N0:5. The first primer 
contains sequence from the 3* UTR of the L mRNA, through the trailer, and primes synthesis of the 
ribozyme (underlined). Also, an A to G change at viral base 15389 (bold), which creates a StuI site within 
the 3'UTR of the L mRNA is encoded by this primer. The second primer encodes the 3' end of the 
ribozyme (underlined) and a Bglll site. The template for this PGR reaction was pSAl, a plasmid 

10 containing the ribozyme sequence, as previously described in Perrotta, A. T. and M. D. Been. 1991. The 
pseudoknot-like structure required for efficient self-cleavage of hepatitis delta virus RNA. "Nature 
350:434-436, which is incorporated herein by reference. The PGR product derived from this reaction was 
cloned into the StuI and Bglll sites of pOCUS-2. The leader and trailer regions were combined into a 
single clone by transferring the Eagl/PstI fragment of the T7/leader clone into the PacI/PstI sites of the 

1 5 trailer/ribozyme clone. 

To prevent possible interference by transcription from cryptic vaccinia virus promoters, vaccinia 
virus polymerase transcription stop signals (TTTTTNT) were inserted upstream and downstream of the 
replicon near PvuII and Sspl sites within pOCUS-2. A T7 transcription termination signal was removed 
from pET-17b by digesting with Blpl and BspEI, and inserted into the Sspl site (blunted with T4 DNA 

20 polymerase) of pOCUS-2. A luciferase reporter gene was then inserted into the Sad and StuI sites to 
create pPIV3-MG(+), which is schematically depicted in Figure 3. 

To generate the full-length HPIV-3 clone, five RT-PCR products were generated from HPIV-3 
virion RNA and cloned. These fragments were subsequently inserted into pPIV3-MG(+), replacing the 
luciferase coding sequences, to create pHPIV-3. the full-length clone. The five RT-PCR products were 
25 generated from HPIV-3 strain 47885 virion RNA which was obtained from Robert Chanock, at the 
National Institues of Health. These PGR products, encompassing the remainder of the HPIV-3 genome, 
were identified by restriction enzyme analysis and cloned, either in pUC19 or pOCUS-2, and then inserted 
into pPIV3-MG(+). 

The first PGR product containing viral bases 83 to 2721 was inserted into the Smal site of pUC19. 

30 The 83/2721 clone was then digested with Sad and XmnI, removing viral bases 94 to 553 which were 
inserted into the Sad and SphI (blunt with T4 DNA polymerase) of pPIV3-MG(+). The 83/2721 clone 
was then digested with PstI to remove a fragment containing viral bases 540 to 2274, which was then 
inserted into the PstI site of the pPIV3-MG(+) clone containing the 94/554 fragment. The second PGR 
product encompassing viral bases 13395 to 15397 was cloned into the Smal site of pUCI9. This 

35 13395/15397 clone was then digested with StuI and Pad and the resulting fragment containing viral bases 
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13632 to 15381 was inserted into the StuI and Pad sites of the pPlV3-MG(+) clone containing viral 
sequence to base 2274. The resulting clone contained viral bases 1 to 2274 and 13632 to 15462 in the 
pPIV3-MG(+) context. 

The third PGR product containing viral bases 7403 to 11513 was digested with BspMI (blunted 
5 with T4 DNA polymerase) and Xhol to produce a fragment containing bases 7437 to 11444 which was 
inserted into the Xhol and Sspl sites of pOCUS-2, The fourth PGR fragment containing viral bases 10904 
to 13773 was digested with PvuII and BamHI (viral bases 10918 to 13733) and inserted into the EcoRJ 
(blunted with T4 DNA polymerase) and BamHI sites of pUG19. The two viral segments were combined 
by digesting the 7437/1 1444 clone with Sad (blunted with T4 DNA polymerase) and EcoNI and inserting 

10 into the 10918/13733 clone digested with EcoRJ (blunted with T4 DNA polymerase) and EcoNl. The 
resulting clone contained viral bases 7437 to 13733 in a pUG19 background. The remainder of the viral 
sequence was derived from a fifth PGR product encompassing viral bases 83 to 7457 which had been 
digested with XmnI and Xhol (viral bases 553 to 7437) and cloned into the StuI and Xhol sites of pOCUS- 
2. The 7437/13733 clone was then digested with BamHI, blunted with T4 DNA polymerase, and digested 

15 with Xhol to release a fragment that was inserted into the EagI (blunted with T4 DNA polymerase) and 
Xhol digested 553/7437 clone. The resuhing clone contained viral bases 553 to 13733. This clone was 
then digested with PshAl and Pad and the resulting fragment containing viral bases 2143 to 13632 was 
inserted into the same sites of the pPIV3-MG(+) clone containing viral bases 1 to 2274 and 13632 to 
15462. This generated pHPIV-3, the infectious clone, which is schematically depicted in Figure 4. 

20 To insert the P gene into pGEM-4, P sequences were transferred from a P-lac-fusion clone, 

(described in 38, which is incorporated herein by reference) by digesting with Xbal (blunted with T4 DNA 
polymerase) BamHI, and inserted into the Kpnl (blunted with T4 DNA polymerase) and BamHI sites of 
pGEM4. The pPIV3-NP and pPlV3-L clones, as described in 15, 16, which are incorporated herein by 
reference., were in a pGEM-4 background. The pPIV3-L clone was also modified. In the natural L mRNA 

25 sequence there is a non-initiating AUG 1 1 nucleotides from the 5' end of the transcript. This was removed 
from pPIV3-L by mutational PGR, changing viral bases 8636 and 8637 from AT to TA. 

Example 2. Preparation of Recombinant HPIV-3. 

Gonfluent monolayers of HeLa cells in 6-well plates were infected with recombinant vaccinia 

30 virus VTF7-3 at a multiplicity of infection of 2. vTF7-3 expresses T7 RNA polymerase as described in. 
Fuerst, T. R., P. L. Earl, and B. Moss. 1987. **Use of a hybrid vaccinia virus-T7 RNA polymerase system 
for expression of target genes." Mol. Gell. Biol. 7:2538-2544, and Fuerst, T. R., E. G. Niles, F. W. 
Studier, and B. Moss. 1986. "Eukaryotic transient-expression system based on recombinant vaccinia virus 
that synthesizes bacteriophage T7 RNA polymerase." Proc. Natl. Acad. Sci. 83:8122-8126, which are 

35 incorporated herein by reference. After 1 hour at 37°G, pPIV3-NP, pPIV3-P. pPIV3-L and pHPIV-3 were 
transfected using Lipofectin (BRL) according to manufacturers instructions. After three hours the 
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transfection medium was removed and replaced with 1.5 ml Dulbecco's modified Eagle's medium 
(DMEM)/5% fetal bovine serum. After 40 to 48 hours the plates were frozen, thawed and scraped. The 
clarified medium supernatant (250 ^1) was then used to infect fresh HeLa cell monolayers in 6-well plates. 
DMEM (L5ml) containing 25 |ig/ml l-B-D-arabinofuranosylcytosine (araC) to inhibit vaccinia virus 
5 replication was added after a 1 hour attachment. After forty hours the plates were frozen, thawed, and 
scraped. The clarified medium supernatant was then titered for HPIV-3 in the presence of AraC. During 
the titering, isolated plaques were picked as agar plugs. The agar plugs were placed in 500 ml opti-MEM 
at 4°C for 4 hr. 250 )il were then used to infect fresh HeLa cell monolayers for amplification of the plaque 
isolates for 40 hr. 

10 In a preferred embodiment transfection conditions were 1 |ig pHPIV-3, 2 ^ig pPIV3-NP, 4 \ig 

pPlV3-P and 0.1 pPIV3-L. Under these condhions approximately 1000 pfu per 6X10^ cells were 
obtained during the initial transfection and 10^ pfu per 6x10^ cells after the amplification. 

When individual plasmids were omitted from the transfection step it was observed that the 
pHPIV-3, pPIV3-P and pPIV3-L plasmids were required for recovery of virus but, surprisingly, as shown 
1 5 in Table 1 below, pPlV3-NP was not. 



20 



Table 1 



Plasmids Transfected 
HPIV-3 
+ 

+ 



NP 

+ 
+ 

+ 
+ 



P 
+ 
+ 



L 
+ 

+ 



Virus Recovery 
14/15' 
0/2 
3/3'' 
0/3 
0/2 



25 



Table 1. Recovery of HPIV-3 from pHPIV-3. HeLa cell monolayers were infected with vTF7-3 and 
transfected with the indicated plasmids. After 40 hr cells were lysed and supematants added to fresh HeLa 
cell monolayers in the presence of araC to inhibit vTF7-3 replication. These monolayers were then lysed 
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and the supematants assayed for HPlV-3. The number of experiments for which HPIV-3 was recovered 
per attempts is displayed under virus recovery, 

'^The single experiment that did not yield HPIV-3 used O.I ug of the P plasmid. 
®The omission of the NP plasmid resulted in 3 to 5 fold lower titers of HPIV-3. 

5 

Characterization of Recovered Virus. 

In order to characterize the recovered virus and to purify HPIV-3 from vTF7-3, plaques of which 
are only slightly smaller than those of HPIV-3, isolated plaques suspected to be HPIV-3 were picked and 
amplified in HeLa cells. The plaque purified and amplified virus isolates and appropriate controls were 
10 then analyzed in neutralization assays. Virus (isolates #3 and #5) was preincubated (30 min on ice) with 5 
ul rabbit pre-immune serum, 5 ul rabbit polyclonal anti-HPIV-3 antisera, or assayed in the presence of 
25ug/ml araC The sera was incubated with virus on ice for 30 min prior to a standard plaque assay. To 
allow maximal plaque development, the plates were then incubated at 37°C for 66 hr prior to staining with 
crystal violet. The results of the assay indicated that the plaque purified virus was completely inhibited by 
15 the anti-HPIV-3 antisera, while vTF7-3 was not. In contrast, the HPIV-3 isolates were not inhibited by 
AraC, whereas the vTF7-3 virus was completely inhibited. Interestingly, of the eight recombinant HPIV-3 
isolates, four had plaque sizes identical to the parental HPIV-3 stock while four were slightly larger. The 
plaque size of isolate #3 was slightly larger than isolate US and the wild type HPIV-3 virus. 

To determine whether the NP coding sequence of pHPIV-3, which shares the same position as 
20 iuciferase in pPIV3-MG(4-), was being expressed from pHPIV-3, pHPIV-3 and pPIV3-NP were separately 
transfected into vTF7-3 infected HeLa cells and cell lists prepared after 48 hr. The lysates were then 
analyzed by Western blotting using an anit-HPIV-3RNP antisera. This antisera recognizes primarily NP 
and reacts poorly with P. 

Specifically, extracts (equivalent to 6x10"* cells) were run on SDS-10% PAGE and transferred to 
25 nitrocellulose membranes. The primary antibody was a rabbit polyclonal anti-RNP antisera diluted 1 : 1 000. 
The secondary antibody was 1:1000 dilution of a goat anti-rabbit antibody conjugated to horseradish 
peroxidase. Visualization was through chemiluminescence (ECL kit, Amersham). As shown by the 
Western blot, HPIV-3 RNP recognized NP from the pPIV3-NP and pHPIV-3 transfected cell extracts and 
from purified HPIV-3 RNP. No proteins were recognized in a mock-transfected HeLa extract. Thus, it 
30 appears that NP is expressed from pHPIV-3, presumably being translated from the T7-directed, 
antigenomic RNA transcript. 

To determine whether the recovered, recombinant virus had specific mutations in its genome, 
RNA was extracted from wild type and plaque isolated viruses and used for RT-PCR analysis using 
primers fianking the substitution mutations. Viral RNA was isolated from approximately 2x10' plaque 
35 forming units (pfti) of plaque purified virus isolates #3, 5, 7 and 9. or wt HPIV-3 strain 47885/, Reverse 
transcription was carried out using Superscript II reverse transcriptase (BMB) at 44°C for I hr using 
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oligonucleotides which primed at viral base 23 or 15100. The PGR was carried out with Expand Long 
polymerase (BMB) using second primers which result in amplification of viral bases 23 to 303, or 15 100 to 
15440, 

PGR products encompassing viral bases 1 to 324 and 15080 to 15462 were generated from the 
5 indicated isolates, digested with Sacl and StuI, respectively, and analyzed on a 1.4% agarose gel.. PGR 
products of the expected sizes were generated in a RT-dependent manner, indicating that the PGR products 
were derived from RNA rather than contaminating plasmid DNA. 

As shown on the agarose gel, the sizes of the 1 to 324 and 15080 to 15462 PGR products are 
increased by 21 and 22 base pairs, respectively, over the length of the viral specific regions due to the 
10 inclusion of restriction enzyme sites in the amplification primers. Digestion with Sad showed that the 
mutation at base 94 was not present in the wild-type virus but was present in the plaque isolated viruses, 
indicating they are of recombinant origin. Similarly, PGR product derived from the region encompassing 
viral base 15389 of wild type HPlV-3 was not cleaved by Stul. However, only four of the eight plaque 
isolated viruses contained the mutation which creates the Stul site. Direct sequencing of the PGR products 
1 5 confirmed these results. 

DISCUSSION 

A full-length plasmid clone of the HPlV-3 genome, pHPIV-3 was constructed.. Upon transfection 
of pHPIV-3 and plasmids encoding the viral NP, P and L proteins into vTF7-3-infected HeLa cells, 
recombinant HPlV-3 bearing genetic markers was efficiently recovered. Several interesting feanjres of this 

20 system were noted. First, the viral NP protein could be expressed from the infectious clone, and this 
expression obviated the need for an NP support plasmid. It is believed that the NP protein is synthesized 
directly from the primary anitgenomic transcript after it is capped by the vaccinia virus capping enzyme. 

Second, two recombinant viruses with distinct genotypes and phenotypes were produced, 
probably due to recombination between the pHPIV-3 and pPIV3-L plasmids, although the possibility that 

25 the reversion arises during RNA replication cannot be excluded. pPIV3-L contains the entire L 3* UTR and 
part of the trailer region, extending to base 15437, an overlap of 48 base pairs beyond the Stul site. This is 
ample room for recombination between plasmids to occur readily in vaccinia virus infected cells 

From these results, there appears to be selection in the HPIV-3 system. All the large plaque virus 
isolates had reverted to a wild-type sequence at base 15389, while retaining the change at base 94, Since 

30 A94G is the only known alteration in these viruses from the parental virus, it appears that The isolates 
which retained both mutations had a plaque size identical to that of the parental (wild type) virus, but when 
the 15389 mutations was lost, plaque size increased, indicatmg that the mutation at base 15389 was 
detrimental in the context of the A94G mutation. There was one other known change between pHPIV-3 
and the support plasmids, A non-initiating AUG exists in the natural L protein message. Since this AUG 

35 is only 1 1 nucleotides from the 5' end of the L mRNA and in a poor translation initiation context, it may 
not cause much interference with L mRNA translation. However, in the support plasmid pPlV3-L this 
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non-initiating AUG is much further from the 5' end of the transcript where it is more likely to be 
recognized by ribosomes. This AUG was removed from pPIV3-L by changing bases 8636 and 8637 from 
AT to TA, destroying a SphI site and creating a Nhel site. To investigate whether this change was present 
in the recombinant virus and could be responsible for the large plaque phenotype, RT-PCR analysis was 
5 done. PGR products encompassing this site and derived from both the wild type and the plaque isolated 
viruses retained a wild type sequence, indicating recombination had not occurred over this region and that 
these changes could not account for any variance in plaque size. 

The finding that recombination may occur between transfected plasmids indicates that care must 
be taken when introducing mutations into the paramyxovirus or rhabdovirus infectious clone systems. 

10 Mutations introduced within the NP, P or L sequences preferably are carried by both the support plasmids 
and the infectious clone. Otherwise, resultant virus may not carry the desired mutation. The only possible 
exception to this is the HPIV-3 system, in which the HPIV-3 infectious clone expresses NP, negating the 
need for the NP support plasmid. Still, it is preferred that the HPIV-3 NP support plasmid be included in 
the system, since significantly greater yields of HPIV-3 were obtained when the support NP piasmid was 

1 5 included in the transfection. 

An infectious clone for HPlV-3 is useful for understanding the molecular biology of HPIV-3 and 
for developing a vaccine for this important pathogen. The ability to generate specific mutations within 
HPIV-3 makes all aspects of HPIV-3 replication amenable to study. Any mutation, including those studied 
previously in other contexts, can now be examined with this system. The ability to introduce specific 

20 mutations also permits the possibility of revertant analysis, which could refine our understanding of 
protein-protein or protein-RNA interactions. 

The infectious clone is also useful for identifying mutations which attenuate the virus. Such virus 
is useful for developing new vaccine strains of HPIV-3. In addition, mutations present in a current 
candidate vaccine strain of HPIV-3 can be inserted into pHPIV-3. Through identifying multiple deleterious 

25 mutations, it should be possible to engineer several mutations affecting various steps in the virus life cycle 
into a single HPIV-3 strain. Such a virus should be highly attenuated and not readily able to revert. 

While the invention has been described to some degree of particularity, various adaptations and 
modifications can be made without departing fi-om the scope of the invention as defined in the appended 
claims 
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CLAIMS 

What is claimed is: 

1 . A recombinant HPIV clone comprising: 

a. a nucleotide sequence encoding a positive sense, antigenome of human 
5 parainfluenza virus; and 

b. an RMA polymerase promoter operatively linked to said nucleotide sequence. 

2. The clone of claim 1 wherein the nucleotide sequence encodes the antigenome of HPlV-3. 

10 3. The clone of claim 2 wherein said clone further comprises a ribozyme sequence downstream from said 
antigenome-encoding sequence, 

4. The clone of claim 2 wherein the RNA polymerase promoter is the T7 RNA polymerase promoter. 

15 5. The clone of claim 3 wherein the ribozyme is an antigenomic ribozyme. 

6. The clone of claim 3 further comprising an RNA polymerase terminator downstream of the ribozyme 
sequence. 

20 7. The clone of claim 2 wherein the nucleotide sequence encodes a modified anti-genome of HPIV-3. 

8. The clone of claim 2 wherein the clone has the characteristics of a plasmid deposited with the 
American Type Culture Collection and having Accession Number - 

25 9. The clone of claim 7 wherein the HPIV-3 antigenome-encoding sequence comprises a mutation selected 
from the group consisting of a substitution of one or more nucleotides, a deletion of from 3 to 12 
nucleotides, and an addition of from 3 to 12 nucleotides. 

10. A method for preparing recombinant human parainfluenza virus comprising; 
30 a. providing a recombinant system which comprises: 

i. an HPIV clone comprising a nucleotide sequence encoding a positive 

sense, antigenome of human parainfluenza virus; 

ii a support clone comprising a nucleotide sequence encoding a human 

parainfluenza virus L protein; and 
35 iii. a support clone comprising a nucleotide sequence encoding a human 

parainfluenza virus P protein, 
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wherein the nucleotide sequence encoding the P protein and L 
protein may be on the same support clone or on separate support 

clones; 

b. introducing the recombinant system of step (a) into host cells; 
5 c. culturing the host cells of step (b) for a time sufficient to permit 

transfection of the host cells and formation 

of recombinant human parainfluenza virus; and 
d. recovering the recombinant human parainfluenza virus from the culture of 
transfected host cells. 

10 

1 1. The method of claim 1 I wherein the antigenome-encoding sequence of the HPIV clone is 
operatively-Iinked to an RNA polymerase promoter; 

wherein the P protein-encoding sequences of the support clones is operatively-Iinked to an RNA 
polymerase promoter ; 

1 5 wherein the L protein-encoding sequences of the support clones is operatively-Iinked to an RNA 

polymerase promoter; and 

wherein the host cells comprise an RNA polymerase corresponding to 
the RNA polymerase promoter of said HPIV clone and said support clones. 

20 12. The method of claim 1 1, wherein the host cells are infected with a viral recombinant capable of 

expressing the RNA polymerase prior to or in combination with introduction of the recombinant system 
into the host cells. 

13. A host cell for producing a recombinant human parainfluenza virus, said host cell comprising: 
a. an HPIV clone comprising a nucleotide sequence encoding a positive 
sense, antigenome of human parainfluenza virus; 
b a support clone comprising a nucleotide sequence encoding a human 
parainfluenza virus L protein; and 

c. a support clone comprising a nucleotide sequence encoding a human 
parainfluenza virus P protein; 

wherein the nucleotide sequence encoding the P protein and L 
protein may be on the same support clone or on separate support 

clones. 

35 14. The host cell of claim 13 further comprising a support clone comprising a nucleotide sequence 
encoding a human parainfluenza virus NP protein. 
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15. The host cell of claim 13 wherein the HPIV clone comprises a nucleotide sequence encoding an anti- 
genome ofHPIV-3. 

16. The host cell of claim 15 wherein the anti-genome encoding sequence of HPIV-3 comprises a site- 
specific mutation. 

17. The host cell of claim 14 wherein the HPIV-3 clone further comprises an RNA polymerase promoter 
operatively linked to the HPIV-3 antigenomic sequence and wherein each of the support clones comprises 
an IINA polymerase promoter operatively linked to the HPIV-3 protein-encoding sequence of said support 
clone. 

1 8. A method of introducing a site-specific mutation into the genome of a recombinant human 
parainfluenza virus, comprising the following steps: 

a. preparing a clone comprising a nucleotide sequence encoding a human 
parainfluenza viral antigenome having a mutation at a specifc site; 

b. co-transfecting host cells with the clone of step (a), 

a support clone comprising a nucleotide sequence encoding an HPIV L 
protein, and a support clone comprising a nucleotide sequence encoding an HPIV P 

protein; 

wherein the nucleotide sequence encoding the P protein and L 
protein may be on the same support clone or on separate support 
clones; and 

c. culturing the transfected host cells for a time sufficient to allow formation of a 
recombinant human parainfluenza virus. 

19. The method of claim 18 further comprising the step of transfecting the host cells with a support clone 
comprising a nucleotide sequence encoding an HPIV NP protein. 

20. The method of claim 18 wherein the clone of step (a) is prepared using polymerase chase reaction 

techniques and a clone comprising a nucleotide sequence encoding a human parainfluenza virus 
antigenome as a template, wherein the antigenome-encoding nucleotide sequence of said template clone 

lacks the site-specific mutation.. 
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1613 1661 1674 

1663 

Plel Cqc81 

NP Gene End P Gene Star t I | 

atcaat |aatQaatQQqaaaQoj ctt |QqqQttQQaqj iQtcctatcQtQccaqaQCQtaqQqtqqtaaatttaqaqtctqctt 1760 



tQgttQttQtttattctttttgQQtcctQQtttcttaggQtQgtQtggtcttgtotctcQCCQtttQQQtctcQgQcgQa 

■ I • • • • • I I a 

1751 1757 
HboII 

SfoNI Ksp632I 
I I 

gcQQctcQQtcQQtQgagQgttc^t^QQQgcgQtgctQQQQQctQtcQQQtcQtggQttcttgggQQgQggQQCCQQgQ 1840 
cgttgQgttQgttQtctctcQQCtQcctttcgctQcgQtttttgatQgtttQgtQcctQQgQQCccttctccttggttct 

1793 1826 

1826 
MboII 

BsqRI EcoRI BbQl 

I I I 

gatQQQtcQQctaatQtctcctcggccctcQQCQtcQttgQQttcQtQctcQgcQccgQcccccaagQQgacctatcggQ 1920 
ctQtttQgttgattQtQgQggagccgggQgttgtQgtQQcttQQgtatgagtcgtggctgggggttcttctggQtQgcct 

1858 1880 1907 

1907 

Hoell Hindi! 



I I 

QaacgQCQCQQtcQQCQcaQgQQCccagcQQCtcQgcgccaccatctgtcQQCCQgQQQtcQQQCCQQcagQQQCQQgtg 
tttgctgtgttQgttgtgttcttgggtcgttgagtcgcggtggtQgQCQgttggtctttQgtttggttgtctttgttcQC 



2000 



I • I 

1955 1968 

HslI B5pl407I Sspl 



iQQQQgttQgtggQtcQQctgQCQQQQQtQgacQgtctgggtcQtcacQcgQQtgtQCQQCQgQQgcQQQogQtQgQQQt 2080 
tttttcQQtcQcctagttgQctgtttttQtctgtcQgttcccagtQgtgtgcttQcatgt tgtct tcgttttctQtctt to 



aaa 

ti _ 

I 

2046 12054 2078 
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Gsul 
EcoBII 

Hbvl P5hAI BsbAI 

Bell Bspl407I MboII BsmAI BsAI 

I I II II 11 I 

QttgQtcQggQQQCtgtQCQgggQggQtctgggQgQQgQQgcQgctcQgQtQgtQgagctgQgQCtgtggtctctggQgg 2160 
taQctQgtcctttgQCQtgtccctcctQgQccctcttcttcgtcgogtctQtcQtctcgQCtctgQCQCcagagQCctcc 
I • I • • I •! • •! I II I • 

2083 2095 2115 2141 2149 

2121 2143 2150 

SfoNI Sspl 

I I 

QQ t c t c t ggaagcQ t cQcaga 1 1 c t qqqqq t ggaacccaaQQC qc ggagaa t q 1 1 go t c t cqq t gaaa 1 1 agaaaga t gg 2240 

ttQgQgQccttcgtQgtgtctQQgQtttttQccttgggttttgtgcctcttQtQactagQgttQctttQQtctttctQcc 
• I * • • I ■ • • 

2173 2210 

PlQl Pstl EcoRV 

I I I 

QtQQggQctctQttgQgaggQQQQtgcgQCQQtctgcQgQtgttccQQgcgQgQtQtcQggQQgtgQtgtcQtQtttQCQ 2320 
tQttcctgagQtQQctctccttttQcgctgttQgQcgtctQCQQggttcgctctQtQgtccttcQCtQCQgtQtQQQtgt 

2246 2274 2293 

Hindlll 
Bell HboII 
I I I 

QCQgQQCQQQgtQgQQQCQgtgQtcatggQQgQQgcttggQQCctQtcQgtQCQCCtgQtQCQQgttcQQtgagtgttgt 2400 
tgtcttgtttcQtctttgtcQCtagtQccttcttcgQQCCttggQtQgtcQtgtggQCtQtgttctQgttQctcQCQQCQ 

2341 2349 

2353 

HboII HboII 
Bbvl HboII Fokl HboII HboII 

I II III i 

tQctgctgcgacQccQgQtgQtgaagQQgQQQtQctaatgQaaaQtagtQggatgQQgQQQQgttcttcaQcacQCCQag 2480 
QtgQcgQcgctgtggtctQctQcttcttctttQtgQttactttttotcQtcctQcttcttttcQQgQQgttgtgtggttc 
I . • i I • • -I I • I • I 

2405 2423 2451 2465 2480 

2426 BsrI 2455 

QQgQtgacQQQagQat taaQQQQgggggggggggQQQQgggaQQgoctggt t tQagaaatcQagagatQCtgacaQCcag 2560 
ItctQctgttttcttQatlttttcccccccccccttttccctttctgQCCQQQttctttQgttctctQtgQctgttggtc 

2526 

QCQtCQQCQtCQgQtcaCQQQCCCQCQtCQQQQgggCQQQQgQQQQtCtCQQQQQCQQCQQCCQCCQQCQCCgQCQCQaa 2640 

tgtQgttgtQgtctQgtgtttgggtgtQgttttcccgttttcttttagQgtttttgttgttggcggttgtggctgtgttt 
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ggggCQQQCQgQQQCQCQgQCQgQQtCQtCQgQQQCQCQQtCCCCQtCQtggQQtCCCQttQtcgQCQQCQQCQCtgaCC 2720 

ccccgtttgtctttgtgtctgtcttagtQgtctttgtgttQggggtQgtQccttQgggtQQtagctgttgttgttQctgg 

• •*•••«• 

gQQCCgQQCagaCQQgCQCQQCCCCCCCQQCQQCQQC t CCCaga t CQQC t eg t QCQQQQgaQ t CQQ t CCgQQCQQQC t c t 2800 

cttggcttgtctgttcgtgttgggggggttgttgttgQgggtctQgttgQgcQtgttttcttagttQggcttgtttgagQ 

Hboll 

Mfol K5p632I CIqI 

I I I 

gQQtccQQQCccQQgacQCQQQQgQCQQt tggQQQggQQQggQQggatQCQgQQgQgQgcQQtcgQt t tacagagagggc 2880 
cttaggtttgggttctgtgttttctgttQQcctttcctttccttcctQtgtcttctctcgttagctQQQtgtctctcccg 

2826 2852 2352 

2852 
esaBI 
I 

QQttQctctQttgcQgQQtcttggtgtQQttcQQtctQCQCtQQQQCtQgQtttQtQtcQQgQcaaQcgQgttgtQtgtg 2960 

ttQQtgagQtQQcgtcttQgQQCCQCQttQQgttQgQtgtQgttttgQtctQQQtQtQgttctgtttgctCQQCQtQCQC 

• • t * I • • • 

2930 

SFqNI 
I 

tQgcQQQigtcctQQQCQQtgtQgQtQctgcQtcQQQgatQgQcttcctQgcQggQttaggcQtQggggtttcQQtggQC 3040 
QtcgtttQCQtgQtttgttQCQtctQtgQcgtQgtttctQtctgaaggQtcgtcctQQtcQgtQtccccQQQgttQcctg 

• « I • • • ■ • 

2990 

Vspl 

AsqI Bglll HboII 

I 1 I 

QQtgQCQCQQQQttQQtQCQgQtQCQQQQtgQQQtgttQaQCCtCQQQgcagQtCtQQQgQgQQtggQCgQQtCQCQtQg 3120 

ttQctgtgttttQQttQtgtctQtgttttQctttQCQQtttggQgtttcgtctQgQtttctcttQcctgcttogtgtQtc 

• I • • • • I • • I 

3052 3091 3120 

Bell BsqR! 

I ' I 

QQgQttgQtQgQQQQtCQQagQgQQCQQClgtCQttgQtCQCQtCgttQQtttCQQQtCttQQQQttQtgQCtgagQgQg 3200 

ttctQQctQtcttttagtttctcttgttgacQgtQQctQgtgtQgcQQttQQQgtttQgQattttQQtQctgQCtctctc 

• • «!« • • • |> 

3156 3198 

Bell Mboll 
I I 

gQggQQQgQQQgQCCQQQQtgQQtCCQQtgagQgQgtQtCtQtgQtCQQgQCaQQQttgQQQgQQgQQQQgQtCQQgQQQ 3280 

ctcctttctttctggttt tact tQggttQctctctCQtQgQtQCtagttctgttttQQCtt let tcttttctogttcttt 

3243 3263 
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Bell HboII 

I I 

gaggQQQgQQQgQCCQQQQtgQQtCCQQtgQgQgQgtQtCtQtgQtCQQgQCQQQQttgQQQgQQgQQQQgQtCQQgQQQ 3280 

ctcctttctttctggttttQcttQggttQCtctctcQtagQtQctQgttctgttttQQCtttcttcttttctQgttcttt 

EcoRII 3243 3253 

SexAI BsqBI Nspl 

II II 
QCCQggtttgQcccQcttQiggQggcQCQQggtQttgQCQQgQQtQtQCCtgQtctQtQicgacatgcQggQQQtQcgtt 3360 
tqgtccQQQCtgggtgQQtQcctccgtgttccQtQQCtgttcttQtQtggactQgQtQtQgctgtQcgtcctttotgcQQ 
II • • • • • I • I • 

3281 3332 3343 

3282 p,,I 

Bcgl 
M 

QgagQQcgQcgtacQQgttQQQtcQgQgQtQttQQgttcQtQCQQcgQgtcQOQtgcaQCQQgQctQQtQctcogQQaQg 3440 
tctcttgttgcQtgttcQQtttQgtctctQtQittcQQgtQtgttgctCQgtttQcgttgttctgQttQtgggtcttttc 

• t * til* • • • 

3406 

Spel 3407 
I 

tgQgCQgtQCQQtgQgQtCQCtQgttgCQgtCQtCQQCQQCagCQQtCtCCCQCQQQgCQgQQQQCQQtCQtQtQtQQQC 3520 

QctcgtcQtgttQctctQgtgQtcQQcgtcQgtQgttgttgtcgitQgagggtgtttcgtcttttgttQgtQtQtQtttg 

• I • • 0 • ■ • 

3450 u , 

HpQl 

Nspl Hindi! 
BsqDI HboII Afllll HboII Hfel „ 

I I I I I I Pstop 

gaac t cqqqcq 1 1 gcQaaag t ga t gaagaag t q t c t goo 1 1 go t ggocQ t g 1 1 cqq t gaago t g 1 1 qqcqq 1 1 qcjtq^q 3600 
cttgQgtttqtQQcgttttcQCtacttcttcQtQgQcttQQctQcctgtQcaogttQcttctQcaQttqttQacgQtttc 

I • I • • I • I'll* 

3530 3545 3567 3578 3589 

3567 3584 
3584 

QtcaQQtQaQQQQQacQQCQCcgaQtQQatQgacQQCQQQCQQCQgtagatcQaaQcctQtcQacQCQCQcaaaatcQag 3680 
tQgtttottttttttgttgtggcttQtttQtctgttctttgttgtCQtctQgttttggatQgttgtgtggtgtttQgttc 

«• •••••• 

Vspl 
AsqI 

P end H start | 

cagagtgQQQCQQt QgQCQtcQQtcoQtQtQc^QQtQQgQMQ^ t t lggqattaaatj aataaat tootcct tgiccQaoQ 3760 
gtctCQCtttgttQtctgtagttagttQtatgtttQttctttttQQQtcctQotttcttQtttQQttQggQQCQggtttt 



Fig. I5f 



3743 
3743 
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PIqI 
I 

tgQgtQtQQctQQCtctgcQQtQtQCQCQttcccggQgtcQtCQttctctgQgQQtggtcQtQtQgQQccQtiQccQCtc 3840 
QctcQtQttgQttgQgQcgttQtQtgtgtQQgggcctCQgtagtQQgQgQctcttQCCQgtQtQtcttggtQQtggtgQg 

3796 , 

PflHI BonHI 

I I 

OQQgtcQQtgQQCQgQgoQQQgcQgtQcctcQCQttQgQgttgccQQQQicggQQQtccQCCQQQQCQtggQtcccggtQ 3920 
tttcQgttQcttgtctcttttcgtcQtggagtgtQQtctcQQcggttttQgcctttQggtggttttgtQcctQgggccQt 

• • • • t • I I * 

H 3^^' 3910 

Fokl MboII Tsp45I 

III r 

ittggQtgtcttcttactcggcttcttcgQgQtggaQcgQQtcQQQgacQQatQcgggQgtgtgQQtgQtctigQCQgtg 4000 
QQQCctQCQgQQgQQtgQgccgQQgQQgctctoccttgcttQgtttctgtttQtgccctcQCQCttQctagQQCtgtcQic 
I |. • I • • • • • I • 

3924 3944 3998 

3929 

Snal 

AvqI Bar I EcoRII 

I I I 

QcccgggttQCQQagtttgtggctctggQtcQttQCCQQtcggQttQgccQQQtQCQCtgggQQtgaccQggQQttQttQ 4080 
tgggcccaQtgtttcQQQCQccgQgQCCtQgtQQtggttagcctQQtcggtttQtgtgQcctttQctggtccttQQtQat 

j» • ♦ ♦ •jo ^« • 

4002 4057 4068 

4002 

BsrI HsII MboII 
I I I 

CQQCtQaQCtggQCQtQgQQgtgQgQQgQQCQgtcaaQgcgQQQgQQatgQttgtttQtQcggtQCQQQQtQtQQQQccD 4160 
gttgatttgacctgtQtcttcQCtcttcttgtcQgtttcgcittctttQctQQCQQQtQtgccQtgttttQtQttttggt 

|*jo|s • e e o e 

4088 4094 4105 

AVQII 

Styl 

Ncol AccI 

Dsal BsrI SfoNl Hfal Oral 

1 I II I II 

gQQctgtQcccQtggtccagtQgQctQQgQQQoggQQtgttgttcgatgccQacaaagttgctcttgctcctcQatgtct 4240 
cttgQCQtgggtQccQggtcQtctgQttctttttccttQCQQCQQgctacggttgttcQQcgQgQQcgQggagttQCQgQ 
I'll! • • I • • I • I • 

4170 4177 4205 4228 4238 

4170 4180 
4170 

4174 
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MboH HfQl Oral 

I I I 

tccQctQgatQggQgcQtQQQQttcQgagtQQtcttcgttQQttgtQcggcQQttggQtcQQtQQccttgtttQQQQttc 4320 
QggtgQtctQtcctcgtQttttQQgtctcQttQgQQgcQQttQQCQtgccgttQQCctQgttQtiggQQCQQQttttQQg 

4273 4291 4311 

BspHI 

BstXI SfaNl Pstl 
II II 

ccQQgtcQQtggcQtcQctQtctctQcccQgcQCQQtQtcQQtcQQtctgcQggtQCQCQtcaQQQCQggggttcagQCt 4400 
ggttcagttQccgtQgtgQtQgogQtgggtcgtgttQtQgttQgttQgQcgtccQtgtgtQgttttgtccccQQgtctgQ 

I • I • • • I 1 
4321 4332 4368 

4370 

Fokl HphI Avail Bell 

II II 
gQttctQQQgggQtQgttcQQQttttggQtgQgQQgggtgQQQQQtcQCtgQQtttCQtggtccQtctcggQttgoitcQQ 4480 
ctQQgatttccctQtcQQgtttQQQQcctQctcttcccQctttttQgtgacttQQQgtQccQggtQgQgcctQQctQgtt 

I . I . . I • I • 

4427 4437 4460 4474 

ScqI 
I 

QQgaQQQgtQggcGgQQtgtQctctgtcgQgtQctgtQQacQgQQQQtcgQgQQQQtgagQttgQtQttttctttgggQt 4550 

ttCtlttCQtCCgtCttQCQtgQgQCQgCtCQtgaCQtttgtCttttQgCtCttttQCtCtQQCtQtQQQQgQQQCCCtQ 

4510 

HslI BsqBI PvuII 

Hboll BsrI NspBII 

till 1 

tQgttggQggaQtcQgtcttCQtgtcQQtgcdactgQtctQtQtcQaQQQCQCtQgcQQCtcagctggtQttCQQQQgg 4640 
otcQQCctccttQgtcQgQagtQCQgttQcgttgQcctQgQtQtQgtttttgtgQtcgttCQgtcgQCCQtQQgttttcc 

* |o| tjli 0 ^1 ^ * 

4577 4593 4623 

4581 4597 4623 

BsqBI 

1 

gQgQtttgttQtcccttQQtggQtctQQQtccacQtctcQQtctQgttQtctgggcttcQtcagtQgagQttQCQQgQgt 4720 
ctctQQacQQtQgggQQttacctQgQtttQggtgtQgagttQgQtcaQtQgQcccgQQgtQgtcQtctctQQtgttctcQ 

4652 

SFqNI 

Fokl EcoRlI 

II I 

ggQtgcQQttttccQQccttctttacctggcgagttcQgQtQctQtcctQQCQttQttgcQQQQggQgttgggQQaQtcQ 4800 
cctQcgttQQQQggttggQQgQQQtgQQccgctcQQgtctatgQtQggattgtQQtQQcgttttcctcQQCCCttttQgt 

4721 4745 

Fig. h 
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Spel 



I Mend 
QQCQQtggQQctQgtQQtctctQttttgQtctggQtQtatctQttQQgccQQQgcQQQtQQgQgQtjQQtcQQaaojctt|Qg 4880 
ttgttQccttgQtcQttQgagatoaQQctagacctatatagataQttcggtttcgtttattctctc ttagttttt gaotc 



4810 

Bbvl HboII 
F start F I I 

gacaQQQgpQgtcQQtQccQQCQQCtQttQgcQgccQCOiCtcgctggQQCQQgQQQgQQgggQtaQQaQQQgtttQQCQg 4960 
ctgttttq ttcQgttatggttgttgQtaatcgtcggtgtgagcgaccttgttctttcttccctatttttttcaaattgtc 



491 : 



4960 



QQgQQQCQQQQQCQQQQQgCQgQgaQCQCCQgQQCQQCQagatCQQQQCQCCCQQCCCQCtCQQQQCgQQaQtCtCQQQQ 5040 

ttctttgtttttgtttttcgtgtcttgtggtcttgttgttctQgttttgtgggttgggtgQgttttgcttttQgagtttt 

HslI 

BstXI SfoNl 
II I 

gQgQttggCQQCQCQQCQQQCQCtgQQCQtc[Qt^CCQQCCtCQQtQCtgctaQttQttQCQQCCQtgattQtggCQtCtt 5120 

ctctQQCcgttgtgttgtttgtgQcttgtQgtQcggttggQgttQtgQcgattQQtQQtgttggtQctQQtQCcgtQgQQ 

5103 5114 
5104 

HslI Hpal HboII 

EcoRV Nspl Hindi! Fokl EcoRV 

1 II I III 

tctgccQQQtagatQtcQCQaQQCtQCQgcQtgtQggtgtQttggttQQCQgtcccQQQgggatgQQgQtQtcQCQQQQC 5200 
QgQcggtttQtctQtQgtgttttgQtgtcgtQCQtccQCQtQQccQQttgtcagggtttccctQcttctQtQgtgttttg 
. I . |] • I • -I I I ■ 

5132 5149 5165 5181 5188 

5150 5165 5185 

Tsp451 

EcoRV HboII HphI 

I I II 

titgQQQCQQgQtQtcQtQttttgQgcctcatQccQQQQQtQgQQgattctQQCtcttgtggtgQccQQCQgatcQQgcQ 5280 
QQQCtttgttctQtQgQttQQQQctcggQgtQtggtttttQtcttctQQgQttgagQQCQCCQCtggttgtctQgttcgt 

• | • • ejj • • 

5211 5243 5261 

5262 

Bell Fokl 
I I 
QtQCQQgQggttQttggQtQgactgQtCQttcctttQtQtgQtggattQQgQttQcagQQggQtgtgQtQgtgtccQQtc 5360 

tQtgttCtCCQQtQQCCtQtCtgQCtagtQQggQQQtQtQCtQCCtQQttCtQQtgtCttCCtQCQCtQtCQCQggltQg 

5304 5341 



Fig. Ij 
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QQgQQtccQQtgQQQQCQCtgQCCCcagQQCQQQQcgQttctttggQggggtQQttggQQCtQttgctctgggQgtggcQ 5440 

ttcttaggttQcttttgtgQctggggtcttgttttgctQQgQQQCCtccccQttQQccttgatQQcgagQCcctcQCcgt 
*»•••• 



Acil Dir^"^^^^ 

NspBii ?a!i 

II II 
QcctcQgcQCQQQttQCQgcggcQgttgctctggttgQQgccQQgcQggcQQgQtcQgQCQttgQQQQQCtcQQggaagc 5520 
tggagtcgtgtttQQtgtcgccgtcQQcgQgQCCQagttcggttcgtccgttctQgtctgtQQctttttgQgttccttcg 

I I' • I I • 

5457 5480 
5459 5485 

Sdul 

SqcI EcoRII 
Bsql HgiAI Bcql Avail 

1 I r II 

QQtcQgggQCQCQQacQQQgcQgtgcQgtcQgtccQgagctccQtGggQQQtttgQtQgtQgcQQttQQQtcggtccQgg 5500 
ttQgtccctgtgtttgtttcgtcQcgtcQgtcQggtctcgaggtQtcctttQQQctQtcQtcgttQQtttQgccaggtcc 
• I • I • • 'I • I I • 

5543 5557 5582 5593 

5557 55% 
5557 

Hindi! Hfel Bbvl Eco57I 

I I II 

Qt t Q 1 9 1 cQQCQQQgQQQ t cg t gccQ t cQQt t gcgogQ 1 1 ogg 1 1 g t gaagcQgcaggac 1 1 cog 1 1 oggaa 1 1 gcQt t q 5680 
tQQtQCQgttgttcctttQgcQcggtQgttQQcgctctQQtccQQCQCttcgtcgtcctgQQgtcQQtccttQQcgtQQt 
I * • I • • •! I 

5606 5628 5551 5660 

HphI 
I 

acacagcQttQctcagQQttQQCQQQCQtQttcggtgatQQcataggQtcQttacaagQaQaagggataaQQttacQQgg 5760 
igtgtcgtQQtgQgtcttQattgtttgtQtQQgccQCtQttgtQtcctQgtQQtgttctttttccctQttttQQtgttcc 

• O 0|ft ft • « • 

5714 

SfoNI Acil 
I I 

tQtQgCQtCQttatQCCgCQCQQQtQtCQCQgagQtQttCQCaQCQtCQQCQgttgQtQQQtQtgQtQtttQtgQtCtQt 5840 

QtQtcgtQgtQQtQtggcgtgtttQtQgtgtctctotQQgtgttgtQgttgtcQQCtQtttQtQctQtQQQtQCtagQtQ 

5765 5776 

HphI Hindu HphI PIqI 

I I I I 

tQtttQCQgQQtcQatQQQggtgagQgttQtQgQtgttgQcttgQotgQttQctCQQtCQCCCtccQQgtcQgQCtccct 5920 
QtQQatgtcltQgttQtttccQCtctCQQtQtctQCQQCtgQQCttQctQQtgagttQgtgggQggttcQgtctgQgggQ 

1 • I • • I • • I • 

5850 5876 5898 5913 



F/g. Ik 
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Fokl 
I 

ttQttQQCtQgQCtgCtgQQCQCCCQgQtttQCQgagtQgQttCCQtQtCCtQtQQCQtCCQQQQCQgQgQQtggtQtQt 6000 

QQtQGttgQtctgQcgQcttgtgggtctQQQtgtctcQtctQQggtQtQgtQtQttgtQggttttgtctcttQccQtQtQ 

5977 

HboII 
Ksp632I 

Earl BspHI Bsnl 

II I I 

ccctcttcccQgccQCQtcQtgQCQQQQggggcQtttctaggtggQgcQgQtgtcQQQgQQtgtQtQgQQgcQttcQgcQ 5080 
gggagQQgggtcggtgtQgtQctgttttccccgtQQQgQtccQcctcgtctacQgtttcttQCQtQtcttcgtQQgtcgt 
II * I • • • • • 'I • 

6003 6018 6071 

6003 
6004 

EcoRII PflHI 
I I 

gttQtQtQtgcccttctgQtccQggQtttgtQctQQQccQtgQQQtggQgagctgtttQtcQggQQQCQtQtcccQQtgt 6160 

CQQtQtQtQcgggQQgQctaggtcctQQQCQtgQtttggtQctttQcctctcgQCQQQtagtcctttgtQtQgggttQCQ 
* *! • I* • • • • 

6101 5118 

OsqI EBseRI 
I I 

ccQQgQQCcgtggttQQQtcQgQCQttgttccQQgQtQtgcQtttgtcQQtggaggagtggttgcQQQttgtQtQQCQQC 6240 
ggttcttggcQCCQQtttQgtctgtQQCQQggttctQtQcgtQQQCQgttQcctcctcQCCQQcgtttQQCQtQttgttg 

6158 6213 
Bspl407I 
MslI 

Nspl Nspl Bell 
Afllll Beg! EcoNI 
II II 1 II 

CQCQtgtQCQtgCQQCggtQtCggtQQtagQQtCQQtCQQCCQCCtgQtCaQggQgtQQaQQttQtQQCQCQtQQQgQQt 6320 

gtgjtQCQtgtQcgttgccQtQgccQttQtcttcgttagttggtggQctQgttcctcQtttttQQtQttgtgtQtttctto 
II I I • I • • • M • 

6242 6252 6284 
6242 6248 6286 
6243 
6245 

Bsnl 

I 

gtQQtQCQQtaggtQtcQQcggQQtgctgttcoQtQCQQQtQQQgaaggQQctcttgcQttttQCQCQCCQQQtgQtatQ 6400 
CQttQtgttQtccQtQgttgccttQcgacQQgttQtgtttQtttcttccttgQgQacgtQQQQtgtgtggtttQctQtQt 

6342 

Fig. n 
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Sdul "boll 
SqcI XbQl PI el 

Mfel HqiAI Bqlll Ksp632I 

I 1 11 I I 

QCQttQQQCQQttctgttgcQCtQgatccQQttgacQtQtcQQtcgQgctCQQtQQggccQaQtcQgQtctQgQQgQgtc 6480 
tgtQotttgttQQgQCQQcgtgaQCtQggtttQCtgtatQgttagctcgQgttQttccggtttQgtctQgQtcttctcQg 

r • I • • I I' I I * 

6429 6446 6446 6473 

6446 6449 6476 

5446 5473 
SFqNI 
I 

QQQQgQQtggQtQQgQQggtCQQQtCQQQQQCtQgQttCCQttggQQQttggcatCQQCttagCQCCQCQQtCQtQQttg 6560 

ttttcttQcctQttcttccQgtttQgtttttgQtctQQggtQQcctttQQCcgtQgttQgatcgtggtgttQgtQttQQc 

6532 

Vspl 

Asel EcoRI 



ttttgatQQtgQtaattatQttgtttQtaattaatgtaacgatQattQtQQttgcagttaagtQttQCQgaattcaaaag 6640 

QQQQCtQttQCtQttQQtQtQQCQQQtQttQQttQCQttgCtQttQQtQttQQCgtCaattCQtQQtgtCttQQgttttC 

I • • • I • 

6590 6630 
6590 



Ndel 
I 



Bglll 



6669 



QgQQQtCgagtggQtCQQQQtgQtQQQCCQtatgtQttQQCQQQCQQQtgQCQgQtCtQtQgQtCQttQgQtQttQQQQt 6720 

ictttagctcQCCtQgttttQctQtttggtQtQCQtQQttgtttgtttQctgtctQgatQtctQgtQQtctQtQQttlttQ 

« • I • m • 

6693 xcRl 
F end HN start | 



tQtQQQQQOCttaggQgtaQQgttQCgCQQttCQQCtCtQCtCQtQtQQttgQgQQQgQQCCCQtlCQgQCQQQtCCQQQt 6800 

gtattttttj qQo ltcctcatttci iatqcqttQaqttgaqatqagtatattQacQctttcttgggttqtctgtttagqtttQ 

6795 

Styl 
Ncol 
OsqI 

BsrI SfoNI Fokl 

I I M 

ccgQcEt|gQQtQCQggQQgcQCQccQQtcQcgggQQQgQtgctggtQQtgQgctggQQQCQtccQtggctQctcQtggc 6880 
ggctcFoccttQtgQccttcgtgtggttQgtgccctttctQcgQCCQttQctcgacc^ 

6813 6839 6861 



Fig. IfR 
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6864 
6864 

m 

Hphl EcoRII HboII 

I III 

QQCQQgQtCQCCQQCQQgQtQQCQtQtQtQttQtggQCQQtQQtCCtggtgttQttQtCQQtagtCttCQtCQtQgtgCt 6960 

ttgttctQgtggttgttctQttgtQtQtQtQQtQcctgttQttQggQccQCQQtQQtQgttQtcQgaagtQgtQtcQcgQ 
I . . . . I . . I |. 

„ , 6888 6925 6945 

n nr 6^49 

Asei BsrOI 

I I 

QQttQQttccQtcQQQagtgQQQQQgcccQtgQactQttgctQCQQgacgtQQQCQQtgQgtttQtggQQgttQCQgQQQ 7040 

ttQQtiQQggtQgttttcQctttttcgggtQcttQqtQQcgQtgttctgcQtttgttQctcQQQtQCCttcQQtgtcttt 
I" • • » • • 

6962 5996 
6962 

Vspl 
Asel 

SfoNI Sspl PIqI 
III I 
QgQtccQQQtggcQtcggQlQQtQttQQtgQtctQQtQcagtcQggQgtgQQtQCQQggcttcttQCQQttcQgQgtcQt 7120 
tctQggtttQccgtagcctQttQtQQttQctQgQttQtgtcQgtcctcQcttQtgttccgQQgQQtgttQQgtctcQgtQ 

7052 7051 7114 
7064 
7064 

EcoRV 
I 

gtccQgQQt tatotaccgatQtcQt tgacQCQacQQQtgtcggQtct taggaaat tCQt togigoaat tacaat toggoa 7200 
CQggtcttQQtQtQtggctQtQgtQQctgtgttgtttacQgcctQgQQtcctttQQgtQQtCQCtttQQtgttQQtcctt 

Cqc81 
Acil 

SfQNl Oral HXwl SfoNI 

II II I 

tgatQQtcgQgQQgtgcctccQCQQQgQQtQQcacQtgQtgcgggcQtQQQQcctttQQQtccQgQtgQtttttggQgat 7280 
QCtQttQgctcttcacggaggtgtttcttQttgtgtQctQcgcccgtQttttggaaatttQqgtctQctQQQQQCctctQ 
• • • |°j *l°" * I* 

7238 7255 7262 7278 

7241 
7241 

HboII 
I 

gcQcgtctggtcttccQtctttQQtgQQQQctccQQQQQtQQgtttQQtgccggggccgggQttQttagctQtgccQQcg 7360 
cgtgcQgaccQgQQggtagQQQt tQct 1 1 tgaggt 1 1 1 tat tccGQt tQcggccccggccctQQtQQtcgatQcggt tgc 

7291 



Fig. In 
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Xhol 
Aval 

I 

QCtgttgQtggctgtgttagQQCtccgtccttQgttQtQQQtgQtctgQtttQtgcttQtQCCtcQQQtctQQttQctQg 7440 
tgQCQQCtQccgQCQCQQtcttgaggcQggQQtcQQtQtttQctQgQCtQQQtQcgQQtQtggQgtttQgQttQQtgQgc 

7437 
7437 

Kpnl 

EcoRII Ban I 

I I 
QggttgccQggQtQtQggQQQatcQtQtcQQgtQttQCQgQtQgggQtoatQQCtgtQQQCtcQgQCttggtQcctgQct 7520 

iccQQcggtcctQtQtccttttQgtQtQgttcQtQQtgtctQtccctQttQttgacQtttgQgtctgQQccQtggQctgQ 

I 4 • » • • • j • 

7447 7510 

7510 

Styl 

Avrll BsqBI 
I I 
tQQQtcctQggatctctcatcct t tcQQCQtQQatgQCQQtagQQagtcatgt tctctagcactcctcQQCQCQgQtgtQ 7600 
QtttQggQtcctQgQgQgtQtgQQQgttgtQtttQctgttQtctttcQgtQCQagagQtcgtgaggQtttgtgtctQCQt 
1 « • • • • • 'I * 

7526 7595 

^^^^ Plel SfQNI HboII EcoRV 

I III 
tQtcQactgtgttcgQCtcccQQQgttgQtgQQagatcQgQttQtgcatcQtcaggcatQgaQgQtQttgtQcttgatQt 7580 

QtogttgQCQCQQgctgagggtttcQQCtQctttctQgtctQQtQcgtQgtQgtccgttttcttctQtQQCQtgQQCtQtQ 

7615 7646 7661 7676 

Acil 

BspHI Bell Ndel ^ 

cgtcaQlcatgatggttcQQtctcQQCQQCQQgQtttQQgQQCQQtQQtQtQQgttttgQtcQQCCQtQtgcggcQttQt 7760 
gcQgttQgtQctQccQQgttQgQgttgttgttctQQQttcttgttQttQtQttCQQQQCtQgttgglQtacgccgtQtttQ 
I . . . . . I • I • I 

7687 7738 7746 

EcoRlI 7751 
PflHI Avail Sspl Aval Fokl 

' ' I ' ' ' 

QCCCQtctgttggaccQgggatQtQctQCQQQggcQaQQtaQtQtttctcgggtQtggQggtcttgQQCQtccQatQdat 7840 

tgggtagQCQQCCtggtccctQtQtgQtgtttccgttttQttQtQQQgQgcccQtQcctccQgaQCttgtQggttQtttQ 

I'll* • •! 1 • * r 

7763 7772 7801 7808 7829 

7775 

Fig. lo 
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SbqI 

Bsnl BsrI Aval BsnAI SfaNI Styl 

I II I I r 

gQgQQtgcQQtctgcQacocQQCtgggtgtcccgggaQQQcgcQgagQQgctgcQQtcQggcatctcatagtccttggtt 7920 
ctcttQcgttagocgttgtgttgQcccQCQgggcccttttgcgtctctctgQcgttQgtccgtQgQgtQtcQggQQCCQa 
I • 'I *l • I • "I • I • 

7843 7862 7871 7887 7901 7913 

7871 

Hindi! 

Fokl Hindi! Eco57! EcoRV 

III II 
ttcQgQCQgQQggatggtCQQCtccQttQttgttgttgQCQQgggctcQQQCtcQQttccQQQgctgQQggtQtggacgQ 8000 
QQgtctgtcttcctQCCQgttgQggtQQtQQcaacQQCtgttcccgQQtttgagttQQggtttcgQCttccQtQcctgct 

• 1 I • • I • • • I ' I' 

7932 7955 7985 7999 

7937 

BsnA! Bsr! Ball I Ball I 

II 11 
tQtccQtgQgacQQQQttQctgggggtcQgQQggQQggctQcttctQctQggtQQcaagQtctQtQtQtQtQCQQgQtct 8080 
QtQggtactctgttttQQtgQcccccQgtcttccttccgQtgQQgQtgQtccQttgttctQgatQtQtQtQtgttctQga 
I • I* • • • I • • I • 

8008 8019 8058 8075 

QCQQgttggcatagcaQgttQCQattQggQatQQttgQtattactgQttQCQgtgQtQtaagaatQQQatggacQtggca 8160 

tgttCQaCCgtQtCgttCQQtgttQQtCCttQttQQCtQtQQtgQCtQQtgtCQCtQtQttCttQttttQCCtgtQCCgt 

Sty! 
Nco! 

EcoRII OsqI Fokl 

I I I 

tQQtgtgctatcQQgQCCQggaQQCQatgQQtgtccQtggggQCQttcQtgcccQgatggQtgtQtQQCQQgQgtQtata 8240 
QttQCQcgatagttctggtcctttgttQcttQCQggtQcccctgtQQgtQcgggtctQcctQtQtQttgtcctcatQtQt 

«|tt «|0 0 |0 « ft 

8177 8195 8219 

8195 
8195 

SfoN! 

BsqB! PIqI BsrI 

I 1 I 

ctgQtgcQtQtccQctcQQtcccQCQgggQgcQttgtgtcQtctgtcQtQttQgQctcgcQQQQQtcgQgagtQQQCccQ 8320 
gQCtQcgtQtaggtgQgttQgggtgtccctcgtQQCQCQgtQgQCQgtQtQQtctgQgcgtttttQgctctcQtttgggt 

8243 8294 8318 

8243 



Fig. Ip 
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Fokl 



Ball 
Cqc8I 
I I I 



PvuII 
NspBII 



g t CQ t QQC 1 1 Qc I CQQCQ t CQQC t gQQQgggt QQQcgagc t ggccQt ccgQQQCQQQQCQC t c t cage t gga t Q t qcqqc 8400 
CQgtQttgQQtgagttgtQgttgQctttcccQtttgclcgQCcggtQggctttgttttgtgagQgtcgQCCtQtQtgttg 

ri I • • I • 

8359 8385 

8361 8385 
8365 

Bbvl Hindlll 
I I 

QQCgQgCtgCQttQCQCQCtQtQQCQQQggQtQttgttttCatQtQgtQgQQQtQQQtCQtQQQQgCttagQCQCQttCC 84B0 

ttgctcgQcgtQQtgtgtgQtQttgtttcctQtQQCQQQQgtQtQtcQtctttQtttagtQttttcgQQtctgtgtQQgg 
I* • • • • • 

8406 8464 

PstI 
Bbvl 
II 

QQCCtCtgttgttCQQQQCQgQgattCCQQQQQgCtgCQgttQQtCQtQQttQQCCQtQQtQtgtQttOQCCtQtCtatQ 8560 

ttggQtQCQQCQQgttttgtctctQQggtttttcgQcgtcQQttQgtQttQQttggtattQtQCQtQQttggQtogQtQt 



8514 
8515 



HN end 



SphI 
Nspl 
Cqc8I 
L start I L 



atQCQQgtQtQtgQtQQgtQQtcQgcQatcQgQCQatQgatQQQQgQdQQQtQtQQQQQcjcttfiggQgcaQactQtgctc 8640 
tQtgttcatatactQttcQttQgtcgttQgtctgttQtctQttttctctttatQttttttgaQtcctcgttt^tQcgag 



HphI 
Tsp45I 



8633 
8633 
8633 



gQQQoEt|gQCQCtgQQtctQQCQQtggcactgtQtctgacQtQctctQtcctgQgtgtcQCCttaQttctcctatcgtt 8720 
ctttttQ^ctgtgQcttQgQttgttQccgtgQCQtQgactgtQtgQgQtaggQCtcQCQgtggQQttQQgaggatQgcQQ 



HslI 



AccI 
Plel 
I I 



8698 
8699 

Fokl Plel Spel 



aQgggtQQQQtagcQCQQttQCQCQCtQttQtgagtctQccQCQgccttQcgatQtggQtgQcgQctcQQtQCtagttQt 8800 
ttcccQttttQtcgtgttQatgt^tgQtQQtQctCQgQtggtgtcggQQtgctQtQcctQCtgctgQgttQtgatCQatQ 

8744 8753 8777 8784 8792 

8755 

Fig. l(jf 
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Sspl 
Vspl 

HboII Asel 
I I I 

CQCtQQQCQgQaQQtQaQQCtCQQtQQQttQgQtQQaagQCQQCgQtCtQttQgQQgQttQQQQttQQtQttQQCtgQgQ 8880 

gtgQtctgtcttttQttttgQgttQtttQQtctQttttctgttgctQgQtQQtcttctQQttttQQttQtQQttgactct 

8854 8864 
8864 
8867 

EcoRII 

EcoRV SexAI 

I II 
QQgtgQQtgQCttQggQQQQtQCQCQtttQttQgQtQtccQgQQQtgtcaaQQgQQQtgttcQQQttQCQtatQcctggt 8960 

ttcacttQctgQQctcttttQtgtgtQQatQQtctQtaggtctttQCQgttttctttQCQogtttQQtgtQtQtggQccQ 

8914 8954 
8955 

Tsp45I Bglll 

I I 
QttQQCQgtaQagtgQctgQQttQttQcttQaQgcQgQtQgaQCQtQtQgtcQQatgQctgatggQttQQgagQtctQtg 9040 

tQQttgtCQtttCQCtgQCttQQtQQtgQQtttCgtCtQtCttgtQtQtCQgtttQCtgQCtQCCtQQttCtCtagQtQC 

• I • • • * * 'I * 

8973 9032 

Vspl 

Vspl Asel 

Asel MboII 

I I I 

gQttQQtgtgctQtcgQQQttagcctcaQQQQatgQtggQQgcQQttQtgQtcttQQtgQQgQQQttQQtQatQtQtcQQ 9120 

CtaQttQCQCgQtQgCtttQQtCggQgttttttQCtQCCttCgttttQtQCtOgQQttQCttCtttQQttQttQtQtQgtt 

9042 9099 
9042 9105 

9105 

Xcnl HboII 
I I- 

QQgttCQCQCQQCCtQtQQQtCQgQtQQQtggtQtQQtCCQttCQGlQQCQtggttCQCtQtCQQgtQtgQtQtgQgQQgQ 9200 

ttcQQgtgtgttggQtQtttQgtctQtttQCCQtQttaqgtQQgttttgtQccQQgtgatQgttcQtQCtQtQCtctcct 

« • * u • « * I * 

9159 9196 
HboII Xnnl 

Bcql Tsp45I Ebsl Mboll 

I I II 

ttgcQQQQQgctcgQQQtgQggtcQCttttQttQtggggQQQgQttataacttgttQgQQgQCCQgttQgQatttcttQtt 9280 

QQCgtttttCgagCtttQCtCCQgtgQQQQttQtQCCCCtttCtQQtQttgQQCQQtCttCtggtCttCttQQQgQatQQ 

I . . I . . . 1*1' 

9203 9222 9258 9266 

9258 9266 



Fig. Ir 
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Pokl Sspl 
I I 

gQtQCQtCCagQQttggttttQQtQttQgQtQQQCQQQQCtQtQQtggttQtCtQQttQCtCCtgQQttQgtQttgCCgt 9360 

ctQtqtQqqtct tQQCcaQQQt totaatc tat 1 1 gt 1 1 1 gatat taccaatagat taat gaggact tQatcQtaacggcQ 
I . • I • . • • • • 

9285 9302 
|sp45I Nspl 

QttgtgQcgtagttgQQggccgQtggQQtQtQagtgcQtgtgctQQgttQgQtccQQQQttQCQQtctatgtQtcQgQQQ 9440 

tQQCQCtgCQtCQQCttCCggCtQCCttQtQttCQCgtQCQCgQttCQQtCtQggttttQQtgttQgQtQCQtagtCttt 
I • < • I • • • • • 

9364 9395 , _., 

EcoRV 

I 

ggcQQtQQtctgtgggQQgtgQtQgQtQQQttgtttccQQttQtgggagQQQQgQCQtttgQtgtgQtQtcQttQttQgQ 9520 

CCgttQttQgaCQCCCttCQCtQtCtQtttQQCQQQggttQQtQCCCtCttttCtgtQQQCtQCQCtQtQgtQQtQatCt 
• • • • • * I * * 

9506 

BspHI Oral 

QCCQCttgcQttQtctctQQttcQQQCtcQtgatcctgttQQQCQQttQQggggQgcttttttQQQtcQtgtgttQtccg 9600 
t gg t gQQcg t QQ t agagat t aag 1 1 1 gag t qc t aggacQQ 1 1 1 g 1 1 qq 1 1 cc cc t cgQQQQQQ 1 1 1 ag t qcqcqq t aggc 

9548 9581 
XbQl CIqI 

I 1 

QgQtggaottgatQtttgQQtctQgagaQtcgQttQQQgQQtttctgQgtgtQgQttQCQttgatQQQQtcttQgQtQtQ 9680 

tCtQCCttQQCtQtQQQCttQQQtCtCttQgCtQQtttCttQQQgQCtCQCQtCtQQtgtQQCtQttttQgQQtCtQtQt 
. .| |. . . • • • 

9621 9629 

Fokl 
SfaNI 
II 

1 1 tQQtQQQtctQCQQtagQtgQQQtQgcQgQgQt 1 1 tctct 1 1 1 1 1 tagQQCQt 1 1 gggcQtcct ccQt tagoggctag 9760 
QQQttQmQgatgttQtctactttQtcgtctctQQQQgQgQQQQQQQtcttgtaQQCCC|tQggQggtQQtctccgQtc 

9740 
9741 

Vspl 

Bbvl Asel 

I I 
tattgcQgcagQQQQagttQgQQQQtQtQtgtQtQttgggQQQCQQttQQQQtttgacactQttQQtQQQtgtcQtgctQ 9840 

QtQQCgtCgtCtttttCaQtettttQtQtQCQtQtQQCCCtttgttQQttttQQOCtgtgQtQQttatttQCQgtQCgQt 
I* « * * • *l« • 

9765 9822 
Bspl407I 9822 



Fig. Is 
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Bspl407I 

HboII Tsp45I Bell 

I I 1 I 

tcttctgtQCQQtQQtQQttQQcggQtQtQgagQQQggcQtggtggacQgtggcctcctgtgQCQttQCCtgQtcQtgcQ 9920 
QgQQgQCQtgttQttQttQQttgcctQtQtctctttccgtQccQcctgtcQccggoggacQCtgtQQtggQctQgtQcgt 
II' • • • • I •! • 

9841 9900 9911 

9846 

NdQl 

EcoRl EcoRV 
I I I 

CQcgQattcQtcQtQQQtgcttQcggttcQQQttctgcgatatcQtQtgQQQQcgctgttgQttQttQccQgagctttQt 10000 
gtgcttQQgtQgtQtttQcgQQtgccQQgtttQQgQcgctQtQgtQtQcttttgcgQCQQCtQQtQQtggtctcgQQQtQ 

9924 9959 

9964 

Hboll BsbAI 
I I 
QggQQtQQQQtttQQtQQQttcatQgQQCctcQgttQgQtgQQgQtttgQCQQtttQtatgQQQgatQQQgcattgtctc 10080 

tCCttQttttQQQttQtttQQgtQtCttggagtCQQtCtQCttCtQQQCtgttQQQtQtQCtttCtQtttcgtQQCQgQg 

• • • cj • 0 tjt 

10041 10076 
Fokl 

Bsrl SfoNI SfoNl HboII 

II II I 

CQQQaQaQtcQQQCtgggQCQCQgtttctcctgcatctQQtttQctgtQccgtQCtQQcgcQtccQQcgQQtcQcgQQgQ 10160 
gtttttttQgtttgQccctgtgtcQQagaggQcgtQgQttaQQtgQCQtggcatgQttgcgtaggttgcttQgtgcttct 
. I . ' I • • 1| • I • 

10093 10113 10140 10156 

10141 

BsqBI Bsrl 
I I 
ttQgttgQQQQQtttQtQgcQgatQgtQQQtttgQtcctQQtcQgQtQttQgattQtgtQgQQtctggggQCtggttQgQ 10240 

QQtCQQCt 1 1 1 tQQQtQtCgtCtQtCQt t tQQQCtaggQt tQgtCtQtQQtCtQQtQCQtCt tQgQCCCCtgaCCQQtC t 

10194 10231 

Plel 

Sspl AccI 

I I I 

tgQtCCQgQQtttQQtQtttCttQtagtCttQQQgQQQQQgQgQtCQQQCQQgQQggtQgQCtCtttgCQQQQQtgQCQt 10320 

QCtQggtcttQQQttQtQQQgQQtQtCQgQQtttctttttctctQgtttgttcttccQtctgagQQQcgtttttQCtgtQ 

t I • • • * I I * * 

10254 10297 

10300 

BsniAI Fokl 

I I 

QtQQQatgQgQgctQCQCQQgttttQtcagQgacQCtQcttgcQQQtaQtQtQgggoQQttctttcQQgQQQQtgggQtg 10400 

tQttttQCtCtCgQtgtgttCQQQQtQgtCtCtgtgQtgQQCgtttQttQtQtCCCtttQQgQQQgttCttttQCCCtQC 



10350 r-. 1* '°396 

Fig. U 
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Bstll07I 

Aflll OsqI AccI 

I I I 

gtQQQQggQgQgQttgQQttQcttQQgQgattQQCQQCCQtQlcQQtQtCQggagttccQcggtQtQQtgQQgtQtQCQQ 10480 

CQttttcctctctQQCttQQtgQQttctctQQttgttggtQtagttQtQgtcctcQaggtgccQtQttQcttcQtQtgtt 
« • J ♦ • • j • • I * 

10422 10458 10473 

10473 

HslII HboII 

I I 

tQQttCtQQQQgtCQtQCQgQtgQtCttQQQQCCtQCQQtQQQQtQQgtQQtCtCQQtttgtCttCtOQtCQgQQQtCQQ 10560 

QttQQgQttttcQgtQtgtctQctQgQQttttggQtgttQttttQttcQttQgagttQQQCQgQQgQttQgtctttQgtt 
• I • « • • • I • • 

10494 10542 
EcoRI Hindll BsmAI Bglll 

II I 1 

QgaQQtttgQQttcQQgtcQQcggQtQtttQCQQtgatggQtQcgagQctgtgQgctgttttctQQCQQCQgQtctcQQQ 10640 
t c 1 1 1 QQQc 1 1 QQg 1 1 cQg 1 1 gcc t Q t QQQt g 1 1 Qc t Qcc t Q t gc t c t gacQC t cgQCOQQQgQt t g 1 1 g t c t Qgogt 1 1 
|. I . . • I • • •] 

10569 10577 10605 10631 

Sspl 

BsqBI BstXl 

I I I 

QQatQctgtcttQQttggQgatQtgQQtcQQCQgctctQtttggQgQQQCttgcQQCCQQQtQtttggattcQQtQQQtt 10720 

tttQtgacQgQQttQQCCtctQtQcttQgttgtcgagQtQQQcctctttgQocgttggtttQtQaQcctQQtttQtttQQ 
) I t • • I ij < • 

10650 10697 

10700 

HphI 
I 

gtttQQttggttQCQCCCtcgtcttgQQggQQgtQcaQtctQtgtQggtgatccctQttgtcctccQtcQgatQQggQQC 10800 
CQQQttQQccQQtgtgggagcQgQQCttccttcQtgttQgQtQCQtccQctQgggQtQacQggQggtogtctQttccttg 



I 

QtQtQtcQttQgaggQtcQccctgQttctggQttttQtgttcQtQQCccQQgQgggggtQtQgQQggQttttQtcQQQQQ 10880 

tQtatQgtQQtCtCCtQgtgggQCtQQgQCCtQQQQtQCQQgtQttgggttCtCCCCCQtQtCttCCtQQQQCQgttttt 

«!« « tt e « • • 

10817 



PvuII 
NspBII 

Bbvl BsrOI 



II I 
ttgtggacQctcQtQtctQtQQgtgcQQtQCQtctogcQgctgttagQQtQggcgtQQgggtQQCtgcQQtggttcQQgg 10950 
QQCQCctgtgagtQtQgQtQttcQcgttQtgtQgQtcgtcgQCQQtcttQtccgcQttcccQttgQcgttQCCQQgttcc 

< • • I I 0 • * I • • 



Fig. la 



10917 10947 
10918 
10918 
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QgQtQQtcQQgctQtQgctgtQQCQQCQQgQgtQcccQQCQQttQtgactQCQgQgttQQgQQggagatQgtttQtQQQg 11040 
tctQttQgttcgatQtcgQCQttgttgttctcQtgggttgttQQtQctgQtgtctcQQttcttcctctQtcQQQtQtttc 

HphI Fokl BspHl 

I II 
QtgtggtgQgQttttttgQttcQttQQgQgQQgtQatggQtgQtctQggtcQtgQacttaQQttQQQtgQQQCOQttQtQ 11120 

tQCQCCQCtCtQQQQQQCtQQgtQQttCtCttCQttQCCtQCtQgQtCCQgtQCttgQQtttQQtttQCtttgttQQtQt 
I • ■ • I • I • • • 

11045 11078 11090 

HboII 

EcoRI XbQl 
I I I 
QgtQgcQQgQtgttcQtQtQtQgcQQQQgQQtQtQttQcgatgggQgQQttcttccccQQgctctgQQQgcQttQtctQg 1 1200 
tcQtcgttctQCQQgtQtQtQtcgttttcttQtQtQQtgctQccctcttQQgQQggggttcgagQctttcgtQQtQgQtc 
• • ■ *l*l * *l* 

11167 11196 
11171 

SfQNI 

HboII BsniAI BsqBI HboII Mfel 

II III I 

atgtgtcttctggtcagagQCQgtaQtQgQcgQQacQQgQtcQgcQtcttcQQQCttggcQQCQtcQtttgcQQQQgcQQ 11280 
tQCQCQgQQgaccQgtctctgtcQttQtctgctttgttctagtcgtQgQQgtttgQQccgttgtagtQQQcgttttcgtt 
I . I . . I- I I • • • !• 

11206 11217 11239 11247 11278 

11244 

Sphl 

EcoNl Nspl f^okl 

HphI Cqc8I Styl BsqBI 

II I I II 

ttgQgQQtggttQttcacctgttctQggatQtgcQtgctcQQtttttaQgQacQttcQQCQQCtQtQtattgcccttggg 11360 
QQCtcttQccQQtttQgtggQCQagQtcctQtQcgtQcgQgttQQQQQttcttgtQagttgttgQtQtQtaacgggaaccc 

I I • • I • * • 'III 

11295 11313 11354 11360 

11298 11313 11359 

'••^•^ SfoNI 

Fokl SfQNI 
II I 

QtgQQtQtCQQtCCQQCtQtQQCQCQgQQtQQCQQQgQtttQtQttttaggQQtCCQQQttggQtgCQQtQtgCQtCttt 11440 

tQcttQtQgttQggttgQtQttgtgtcttQtQgtttctQQQtQtQQQQtccttaggtttQQCctQcgttQtQcgtagQQQ 

11422 11433 
11423 

Ball 

BspHI HslI Sspl 

1 II . I 

QQtQcctgctQgtgttgggggQttcQOttQCQtggccQtgtcQagQtgttttgtQQggaQtQttggcgQtccQtcQgttg 11520 
ttQtggQcgQtcQCQQccccctQQgttQQtgtQccggtQCQgttctQCQQQQCQttccttataQccgctQggtagtcQQC 

I . . -I I • • I- 

11444 11471 11499 

Fig. ly 11472 
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Acil SexAI 

I 1 
ccgcQitQgctgatQttQQQQgatttQttQQggcgQQCctQttQgQccgQQgtgttctttQtQggQttQtgQQtcQQgQQ 11500 
ggcgtaQtcgQCtQtQQttttctQQQtQQttccgcttggatQQtctggcttcQCQQgaQQtQtcctQQtactiQgttctt 
J ••••••• j 

11521 11600 
Plel 
Hphl 

EcoRII Bsrl Eco57I MslI 

III II I 

ccQggtgagtcQtcttttttggQctgggcttcagQCCCQtQttcQtgcQQtttaccQCQQtctcaGQatQtQQccQCtQt 11580 

QgtccQCtcQgtQgQQQQQQCctgQcccgQQgtctgggtQgQQgtQcgttQQQtggtgttQgQgttttQtQttggtgQta 

III- • I 1- 1 • 

11601 11623 11629 11638 

11604 

11607 Hphl 

I 

gQtQQQQQQtQtQQCQgCQQgaQQtgtQttQCQQgattCQCCgQQtCCQttQttQtCtggQttQttCQCQQQtQCQQtgQ 11760 

ctQttttttQtQttgtcgttctttQCQtQQtgttctQQgtggcttaggtQQtQQtQgQcctQQtQQgtgtttQtgttQCt 

11718 

HboII 

HboII HboII XbQl 

II I I 

tQgQagQQgQtgQQgQQttQgctgQgttcttgatggQCQggQQggtQQttctccctQgagttgcQCQtgQiQttctagQt 11840 

QtCttCttCtQCttCttQQtCgQCtCQQgQQCtQCCtgtCCttCCQttQQgQQggQtCtCQQCgtgtaCtQtQQgatCtQ 

I I . I . . . . . • I • 

11763 11772 11834 
11766 

BseRI 

.! 

QQttctctcQCQggaQtcQgQQQtgctQtQgctggQtttgttQgQtacgQCQQQQtctctQQttcgggttggcQtQaatQg 11920 
ttaQgagagtgtccttagtctttQcgatQtcgaccttQcaatctQtgctgttttagagattaagcccaQCcgtQtttQtc 

«•«•••*<! 

11920 

QggQggQctgQCQtQcagtttgttgaggQQQQtcQgtQQttQcgQtctQgtQCQQtQtgQQQCQCtQQgtQggQCtttgc 12000 
tcctcctgQctgtQtgtcQQQCQQCtccttttQgtcQttQQtgctQgQtcQtgttQtQCtttgtgQttcQtcctgaaQcg 

Hgoi 

HboII AccI BsrOI 

I I II 

gQCtaQttgtQQgcgQCQQQQtcaggtatgaagattjtgtgttcggtQgQccttgctQtagcQttgcgtcaQuagutgtgg 12080 
ctgat tQQCQt t cgctQt 1 1 tagtccQtoct tctatQCQCQogccQtct ggaQcgatatcqtaQcgcagt 1 1 tctacQcc 

I • I • -I I • 

12030 12045 12061 

12065 



Fig. Ill/ 
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Fokl HphI 

I - 1 

QttcQtttQtcaggQggQQggQtgQtQQgtggQcttgQQQCQCCtgatccQttQgoQttQctQtctggggtgQtQQtQQc 12160 

tQQgtQQQtQgtcctccttcctQctQttcQCCtgQQCttigtggactQggtQQtcttQQtgQtQgQccccQCtQttQttg 
• I 

12100 12149 
Eco57I 

Mboll BstXI Fokl Sspl 

II II I 

QggQtcggQQCQttgtQQQQtQtgttQttcttcQgQtggcQCQQQCccQtQtQcttggatgtQtttQccgggtQQtQttQ 12240 
tcctQgccttgtQQCQttttQtQcaQtQagQQgtctQccgtgtttgggtatQtgQQCCtQcatQQQtggcccQttQtQQt 

ll • I • I • • I ' 

12189 12207 12217 12234 

12190 

Tsp45I 
AMI Bglll 
I I 1 

QQQtQggotcQgcQgQQQCQggtQtQtcQtcQttgQgQgttccttQttttggQtcogtcQCtgQtgQgQgQtctgQggcQ 12320 
tttQtcctQgtcgtctttgtccQtQtQgtagtQQctctcQaggQQtQQQQCctQgtcQgtgQCtQCtctctQgQctccgt 

12295 12309 
12297 

Hfel BspHI Acil BsrOI 

1 III 

CQQttgggQtQtQtcQQgQQtcttQgtQQQcctgcQQQQgccgcQatQQgQQtagcQQtgQtQtQtQCQtgggcQtttgg 12400 
gttQQccctQtQtQgttcttagQQtcQtttggQcgttttcggcgttQttcttQtcgttQctQtQtQtgtQcccgtQQacc 
1 • • \ 'I • I • 

12321 12350 12361 12375 

Pnll 
BsqAI 

EcoRV Fokl M Afllll BsnAI 

III II i 

iQQtgQtgagQtQtcttggQtggQggcctcQcaQQtQgcQcaQQCQcgtgcQQQttttQCQCtQgQtQgtctcQQQattc 12480 
QttQctactctQtQgQQcctQcctccggQgtgtttQtcgtgtttgtgcQcgtttQQQatgtgatctQtcQgagttttQQg 

I 1- I • • ll • • r 

12410 12418 12424 12444 12469 

12445 

12445 „ , 

BsrI 

Aqel Gsul 
I 1 I 

tQQCQccggtQgctQCQtcaQCQQQtttQtcQCQcagQttQQQggatQgtgcQQCccQgatgQQgttctccQgtQCQtcQ 12560 
QttgtggccQtcgQtgtQgttgtttQQQtQgtgtgtctQQtttcctQtgocgttgggtctQcttcQQgQggtCQtgtQgt 
1 ...... I . 

12485 12548 

12550 



Fig. Ix 
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Eso3lI 

Nspl BsraAI 
HslI Afllll BsqI BsqBI 

I I II 

ttgQttQgggtcagcQgattcQtQQCQQtgtccaQtgQtQQCQtgtctQtcQQggQQgctQQtgagQccQQQgQtQccaQ 12640 
QQCtQQtcccQgtcgtctQQgtQttgttQCQggttQctQttgtQCQgQtQgttccttcgattQctctggtttctQtggtt 
•I • -I • • I • I • 

12581 12601 12624 12633 

12601 12624 

12624 

Hpal Sspl 
Hindi! MI MboII 

I II I 

tcttQtttQtcaQCQQQtQQtgttQQCQggQttaQgtgttttcgQatQtttQtttQgQttQgQQgaQQCCQCQggQcacQ 12720 
QgQQtQOQttigttgtttattQCQQttgtcctQQttcQCQQQagcttQtQQQtQQQtctQQtcttctttggtgtcctgtgt 
. I . -I I • -I 

12662 12681 12702 

12662 12685 
Bsgl Vspl AccI 

HslI SfQNl MboII Asel Plel 

I I I I III 

QCcccQtQgttQtgcQtctgcQCQtQgQQgQtgQgtgttgcQttQaQgQQQgttttQQtgQtgQgcQtQttQatccQgag 12800 
tggggtQtcQQtQcgtagacgtgtQtct tctQctcQCQQCQtoat t tct t tcQQQQt toe tQctcgtatoot toggtctc 
I • I I • 1 • • • • ]• II 

12725 12734 12747 12789 12798 

12738 12789 12800 

Vspl Kpnl BsrBl 
Asel Banl Acil 
I I I 
tctdCQttagQQttQQttQggtQccctgQQQgtQQtgaatttQtttatgQtQQQgQcccgctcQQggQcgtggQcttQtc 12880 
QgQtgtQQtcttQQttaatccQtgggQctttcQttQcttQQatQQatQctQtttctaggcgagttcctgcacctgQQtQg 

• I I • • • • 

12812 12820 12858 
12812 12820 12858 

Hfel Fokl Nspl 

1 I I 

QQQQGttQtggltQttQQQgQtCQttCttQCQCQQttgQtQtgQQttQttgggQtgQtQCtgQCQtCatQCQtgCQQttt 12960 
ttttgQQtQCCQQtQQtttCtQgtQQgQQtgtgttQQCtQtQCttQQtQQCCCtQCtQtgQCtgtagtQtgtQCgttQQQ 

• « • I • 0 I « I * 

12913 12932 12950 

M Tsp45I BsrOI 

11 I 

CQQtQtgtQCtgCQQttQCQQtQgCQgQCQCtQtgtCQCQQttQgQtCgagQtQQCttQQQQgQgatQQtQgtCQttgCQ 13040 

gttQtQCQtgQcgttQQtgttQtcgtctgtgQtQCQgtgttQQtctagctctQttgQQttttctctQttQtcQgtQQcgt 

• • 1*1* * * *l* 

12987 12995 13034 



Fig. ]y 
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Vspl 

Asel Plel Aflll 

I I I 

QQtgQtgQtgQtQtiQQtQgcttQQtcQCtgQQtttttgQctcttgatQtQcttgtQtttcttQQgacQtttggtggQtt 13120 

ttQCtQCtQCtQtQQttQtCgQQttQgtgaCttQQQQQCtgagQQCtQtQtgQQCQtQQQgQQttCtgtQQQCCQCCtQQ 

• I • • I* • • I . • 

13053 13079 13101 

'3053 Avail 
Oral PpuMI 

I II 

QttQgtQQQtcQQtttgcQtQCQCtctttQtQgtttQQQQQCcgQQggtQgggQcctcQtttgggQttQtQtQQtgagQQ 13200 

tQQtCQtttQgttQQQCgtQtgtgQgQQQtQtCQQQtttttggCttCCQtCCCtggagtQQQCCCtQQtQtQttQCtCtt 

• • * I * 'II * * * 

13154 1317! 

13172 

Sspl Fokl 

I I 

CQCtgagQgQtQcttcccQttcQQtattQQQQgtQttQtctQQtgcQttQtctcQtcctQQQgtQttcQQgQggttctgg 13280 
gtgQctctctQtgQQgggtQQgttQtQQttttcQtQQtagottQcgtaQtQgQgtQggQtttcQtQQgttctccQQgacc 

• 'I' * *l* * * 

13223 13254 
Plel Sspl 

I I 

gQttgtggagtcttQQQccctQtttQtggccctQQtactgctQgtCQQgQCCQgQtQQQQCttgccctctctQtQtgtgQ 13360 
ctQQCQCctcQgQQtttgggQtQQQtQccgggQttQtgQcgQtcQgttctggtctQttttgQQcgggQgQgQtQtQCQCt 

I • • • • t a . I 

13288 13360 
Bglll |sp45I 

QtQttcQCtQgQtctQtttQtgagQgQQtggttgaQtggtgtQtCQCttgQQQtQtQCQtttgtgQCQgcgQtQtggaQg 13340 

tQtQQgtgQtCtQgQtQQQtQCtCtCttQCCQQCttQCCQCQtQgtgiQCtttQtQtgtQaQCQCtgtCgCtQtQCCttC 

I • • • « I • • 

13370 13423 

XbQl SfoNI AvqII 

I I I 

itgcgQQtgQtQggQQQCQQgcctttQtttctQgQCQCCtttcQtttgtttgttgtttQgcQgQQQttgcQtcttttggQ 13520 

QQCgCttQCtQtCCtttgttCggQQQtQQQgatCtgtggQQQgtQQQCQQQCQQCQQQtCgtCtttQQCgtQgaQQQCCt 

■ • I « 0 • |« I • 

13470 13509 13518 

BsnAI Sspl 

I I 

CCtQQCCtgttQQQCttaQCQtQCttQgQgagQCttgQtCtQttgQQQCQQtQtCttgQQttQQQtQttQQQgQCgQCCC 13600 

ggQttggQCQQtttgaQitgtQtgQQtctctctgagctQgQtQQctttgttQtQgQQcttQQtttQtQQtttctgctggg 

• • I « e o I • « 

13550 13584 



Fig. Iz 
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Pad 

Vspl SnaBI 

Bspl407I Asel BsqAI 

I II I 

CdttCttQQQtQtgtQCQQQtQtCtggQttattQQttQQQtCgttCCCQtCQQCtgtQQCQtQCgtQQgQQQgQCtgCQQ 13680 

QtgQgQQtttQtQCQtgtttQtQgQcctQQtQQttQQtttagcQQgggtQgttgQCQttgtQtgcattctttctgQcgtt 
. I . -11 • • • I • 

13613 13631 13662 

13531 13662 

13632 

BqiuHI 

SsDl Bsu36I BsqB! Mboll 

I I III 

tcQQQiQtttQQggQttcgtggtQttQgtccQCCtgaggtQQttgQtgQttgggQtccgQtQgQQgQtgQaQQtQtgctg 13760 

QgtttQtttQQttCCtQQgCQCCQtQQtCQggtggQCtCCQttQQCtQCtQQCCCtQggCtQtCttCtQCttttQtQCgQC 

I . . • I • I • I • I • 

13584 13713 13728 13743 . 

''''' 

I 

gatQQCQttgtCQQQQCtQtQQQtgQtQQttgtQQtQQQgQtQQtQQQgggQQtQQQQttQQCQQtttCtggggQCtQgC 13840 
CtQttgtQQCQgttttgQtQtttQCtQttQQCattQtttCtQttQtttCCCttQttttQQttgttQQQgQCCCCtgQtCg 

• • • • • • *l' 

13838 

Avail 13838 
Aflll PpuHI Bqlll SfaNI Agel 

I II 1 II 

gcttQQgQQctatcQggtccttQQQQtcQgQtctQtQQCQQgtgQttctgQtQQtQQtgQtQgatcQgQigctQgiQccg 13920 
cgQQt tct tgatogtccaggaat 1 1 tagtctagatat tgt tCQCtQagactat tat lactate tagtctacgatcQiggc 
I • II • I' • • • 1*1* 

13842 13855 13869 13908 13917 

13856 Hfel Spel 

1 1 
gtggtitgQCQCttcctcQQggggggQQttQtclQtcQCQtcQQttgagQttQttcggQQtCQQCQgcQCtQgttgtctg 14000 

cQCCQQQCtgtgQQggQgitccccccttQQtQgQtQgtgtQgttQQCtctQQtQQgccttQgttgtcgtgatcQQCQgQC 

• • • • I • • I • • 

13952 13989 
Styl 
KboII 
Ksp632I 
Earl 

EcoRII Sap I Avrll 
I 111 I 

aaagctcttgagttatcacaaattttaQtgaQggaagttoataaQgaccaggQcaggctcttcctQggagaaggagcagg 14080 
tttcgagQQCtcaatagtgtttaaaattacttccttcQattatttctggtcctgtccgagQaggatcctcttcctcgtcc 

I • III' I 
14048 14057 14063 
14058 

. 1 14058 

Fiq. laa 14059 

J 14063 
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PstI 

Nspl BspHI 
Nhel AvqII 
Cqc8I SFqNI PpuHI 

II I III I 

QgctQtgctQgcQtgtiQtgQtgccacQttQggQcctgcQgttQQttattQtQQttccggtttgQQlQtQQCQgQtgtQQ M160 
tcgQtQcgQtcgtQCQatQctQcggtgtQQtcctg^QcgtcaQttQQtQQtQttQQggccQQQCttQtQttgtctQCQtt 

14087 14100 14111 
14087 14112 
14091 14114 

14116 

Ssp! 

Hindi! Xinnl Eco571 Tsp45l 

I I I I I 

tiggtcQQcgQgQQttgQQQQtQttcccttcQgQggtQtcQttQgtQggtQQQQQQttQggQQQtgtgQCQcagQtcctt 14240 
QQCCQgttgctcttaacttttQtQQgggQQgtctccQtagtQQtcQtccQttttttaQtcctttQCQCtgtgtctQggQQ 

I • I I I • • • • I • 

14164 14177 14188 14226 
14180 BsinBL 
ScQl Xcnl Bsml BsnAI 

I I II 

QQtagggtQQQQgtQctgttcQQtggQQQtccQQQttcQQCQtggatQggQQQtQtggQQtgcgQgQcgttQQtQtggQg 14320 

ttQtCCCQttttCQtgQCQQgttQCCtttQqgtttQQgttgtQCCtQtCClttQtQCCttQCgCtCtgCQQttQtQQCtC 
•l » • • 1*1* * 

14252 4271 14298 141304 

141304 

XmnI 
HboII 

I I 

tgQQttgQQtgatQQgtctQttggQttggtQCQttgtgatQtggQQggagctQtcggtQQQtcQgQQgQQQCtgttctQc 14400 
QcttQQCttQctQttcQgatQQcctQQccQtgtaacQCtQtQccttcctcgatQgccQtttQgtcttcttcgQCQogQtg 

• • • • • *li* * 

14385 
14388 

HslI Fokl 
I I 
QtgQQcactQtagtgttQtQQgQQttQCQtQCttgQttggggQtgQtgQtgttgttttQQtttccQQQQttQtQCCtQCQ 14480 

tQCttgtgQtQtCQCQQtQttCttQQtgtQtgQQCtQQCCCCtQCtQCtQCQQCQQQQttQQQggttttQQtQtggQtgt 

j» a o aj • • • • 

14406 14441 

XbQl 

I 

QtcQctccgQQttggtctagQQtQctttQtctQtQtaagttQtQttggQQQgQttgQQgtQtQQtQtCQCttQQQQCttc 14560 

tQgtgQggcttQQccogQtcttQtgQQQtQgQtQtQttcQQtQtaQCCtttctQcattcatattQtQgtgQQtittgQQg 

• I • • < • • • * 

Ffg. loo 
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SfaNI SfaNI 

toQtcctqcQtcQQcagQQttQtQtctQQtttcaQQQgatgcgtQttgtactQtQQtggQQCCtQgtgQQgttgttttQt 14640 
attQggQcgtQgttgtcttaQtQtQgQttQOogttttctQcgcQtQQCQtgaiQttaccttggQtCQCttCQQcaQQQtQ 

14568 14598 

Tsp45I HboII lft>oJI 
11 1 

CQQQQCttQQQQgattgtCQCtCttggQQgQQQQlQQtCtQttQQQQtggQtCQttttQtCQOQgQagQQQQQtQQtgOQ 14720 
gtlttgQQttttCtOQCQgtgQgaQCCttCttttQttQgQtQQttttQCCtQgtQQQQtQgtttCttCtttttQttQCtt 

mi 14667 14705 

Eco31I 

BsPiAI 

BspHI BsqI Ndel 

I II 
tggttQcatcQtgQQQtcQQQgQQggggQQQgagQttQtggQgttQtgQgQccQtatCQtQtggcQttQCQQQtttttgg 14800 

QccaatgtagtQctttQgtttcttcccctttctctQQtQcctcQQtoctctggtQtQgtQtQccgtQQtgtttQQQaQCC 

14729 14768 14778 

14768 
14768 
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